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This application claims the benefit of United States provisional patent 
application serial number 60/157,923, filed October 6, 1999. The entire content 
of this provisional patent application is incorporated herein by reference. 

The invention disclosed herein was made in part with Government 
5 support under Grant DAMD 17-94-J-4234 awarded by the Department of 
Defense and POl CA32737 awarded by the National Institutes of Health. The 
Government may have certain rights to the invention. 

FIELD OF THE INVENTION 

10 This invention relates to nucleic acid and amino acid sequences of new 

genes that are differentially expressed in cells that overexpress the Her-2/neu 
oncogene. 

BACKGROUND OF THE INVENTION 

15 Cancers of the breast and ovary are among the leading causes of death 

among women. The cumulative lifetime risk of a woman developing breast 
cancer estimated to be 1 in 9. Consequently, understanding the origins of these 
malignancies as well as models for the identification of new diagnostic and 
therapeutic modalities is of significant interest to health care professionals. In 

20 this context, cancer cells have been shown to exhibit unique gene expression, and 
dozens of cancer-specific genetic markers, tumor antigens, have been identified. 

The human HER-2/zkw (c-erbB-2) proto-oncogene encodes a 
transmembrane receptor tyrosine kinase with extensive sequence homology to 
the epidermal growth factor receptor (EGFR) (Bargmann, C. I., Hung, M. C and 

25 Weinberg, R. A. (1986) Cell, 45(5), 649-57). Amplification and/or overexpression 
of HER~2/w// has been found in one-third , of human breast and one-fifth of 
ovarian cancers ( Slamon, D. J., Clark, G. M., Wong, S. G., Levin, W. J., Ullrich, 
A. and McGuire, W. L. (1987) Science, 235(4785), 177-82Slamon, D. J., 
Godolphin, W., Jones, L. A., Holt, J. A., Wong, S. G., Keith, D. E., Levin, W. J., 

30 Stuart, S. G., Udove, J, Ullrich, A. and et al. (1989) Science, 244(4905), 707-12). 
In addition, the HER-2///W alteration is associated with a poor clinical outcome 
in that women whose tumors contain it experience earlier disease relapse and 
shorter overall survival (Slamon, D. J., Godolphin, W., Jones, L. A., Holt, J. A., 
Wong, S. G., Keith, D. E., Levin, W. J., Stuart, S. G., Udove, J., Ullrich, A. and et 
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al. (1989) Science, 244(4905), 70742; O'Reilly. S. M, Barnes, D. M., Camplejohn, 
R. S, Bartkova, J., Gregory, W. M. and Richards, M. A. (1991) Br / Cancer, 63(3), 
444-6; Press, M. F., Pike, M. C, Chazin, V. R, Hung, G., Udove, J. A., 
Markowicz, M., Danyluk, J., Godolphin, W., Sliwkowski, M., Akita, R. and et al. 
5 (1993) Cancer Res, 53(20), 4960-70; Seshadri, R., Horsfall, D. J., Firgaira, F., 
McCaul, IC, Setlur, V., Chalmers, A. H., Yeo, R., Ingram, D., Dawkins, H. and 
Hahnel, R. (1994) Int J Cancer, 56(1), 61-5). Two hypotheses potentially account 
for this prognostic association. First, HER-2 / neu overexpression may be an 
epiphenomenon serving merely as a marker of aggressive breast cancers. 

10 Conversely, the alteration may be causal for the aggressive phenotype. 
Considerable circumstantial evidence now supports the latter possibility, with 
data suggesting that overexpression plays a direct causal role in the pathogenesis 
of the malignancies in which it occurs (Shih, C, Padhy, L. C, Murray, M. and 
Weinberg, R. A. (1981) Nature, 290(5803), 261-4; Di Fiore, P. P., Pierce, J. R, 

15 Kraus, M. H., Segatto, O., King, C. R. and Aaronson, S. A. (1987) Science, 
237(4811), 178-82; Hudziak, R. M, Schlcssinger, J. and Ullrich, A. (1987) Proc 
Natl Acad Sci USA, 84(20), 7159-63). 

The subtraction cloning technique termed differential hybridization, also 
known as plus/minus screening (St John, T. P. and Davis, R. W. (1979) Cell, 

20 16(2), 443-52), can be used to isolate genes which are differentially expressed in 
cells which overexpress HER-2 as compared to control cells. This approach has 
the advantage of comparing two human breast cancer cell lines which are 
identical except for HER-2//W/ overexpression allowing for a direct comparison 
of cDNAs derived from the two cell populations. As disclosed herein, using this 

25 approach, we identified a series of genes, either previously characterized or 
entirely novel, whose expression levels are altered in association with HER-2///^ 
overexpression. The evidence suggests that these genes might be mediators of 
the HER-2 overexpressing phenotype since we have confirmed their differential 
expression not only in human ovarian cancer cell lines which overexpress HER-2 

30 but also primary breast cancer specimens containing this alteration. 

The discovery of Her-2/ neu overexpression modulated proteins, and the 
polynucleotides which encode them satisfies a need in the art by providing new 
compositions which have potential in understanding and modulating disorders 
associated with cell proliferation. 
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SUMMARY OF THE INVENTION 

The present invention provides new Her-2/neu overexpression 
modulated proteins (including proteins having both new and known amino acid 
5 sequences such as novel splice variants of known proteins) hereinafter designated 
HOMPS. A first HOMPS protein is designated HI 7. The expression of HI 7 
increases in cells which overexpress Her-2/neu. A second HOMPS protein is 
designated C40. The expression of C40 decreases in cells which overexpress 
Her-2/neu. A third HOMPS protein is designated H41. The expression of H41 

10 increases in cells which overexpress Her-2/neu. A fourth HOMPS protein is 
designated H13. The expression of H13 increases in cells which overexpress 
Her-2/neu. A fifth HOMPS protein is designated HI 4. The expression of HI 4 
increases in cells which overexpress Her-2/neu. In addition, the present 
invention discloses a HOMPS related polynucleotide sequence designated H63. 

15 The expression of H63 increases in cells which overexpress Her-2/ neu. 

The invention provides polynucleotides corresponding or complementary 
to ail or part of the HOMPS genes, mRNAs, and/ or coding sequences, preferably 
in isolated form, including polynucleotides encoding HOMPS proteins and 
fragments thereof, DNA, RNA, DNA/RNA hybrid, and related molecules, 

20 polynucleotides or oligonucleotides complementary to the HOMPS genes or 
mRNA sequences or parts thereof, and polynucleotides or oligonucleotides that 
hybridize to the HOMPS genes, rnRNAs, or to HOMPS-encoding 
polynucleotides. Also provided are means for isolating cDNAs and the genes 
encoding HOMPS. Recombinant DNA molecules containing HOMPS 

25 polynucleotides, cells transformed or transduced with such molecules, and host- 
vector systems for the expression of HOMPS gene products are also provided. 
The invention further provides HOMPS proteins and polypeptide fragments 
thereof. The invention further provides antibodies that bind to HOMPS proteins 
and polypeptide fragments thereof, including polyclonal and monoclonal 

30 antibodies, murine and other mammalian antibodies, chimeric antibodies, 
humanized and fully human antibodies, and antibodies labeled with a detectable 
marker. 

Accordingly, the invention provides a substantially purified HOMPS 
having the amino acid sequence shown in Figure 2, Figure 4, Figure 6, Figure 9 
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or Figure 11. A typical embodiment of the invention provides an isolated and 
substantially purified polynucleotide that encodes HOMPS. In a particular aspect, 
the polynucleotide is the nucleotide sequence shown in Figure 1, Figure 3, Figure 
5, Figure 8 or Figure 10. The invention also provides a polynucleotide sequence 
5 comprising the complement of the nucleotide sequences shown in Figure 1, 
Figure 3, Figure 5, Figure 8 or Figure 10 or variants thereof. In addition, the 
invention provides polynucleotide sequences which hybridize under stringent 
conditions to the nucleotide sequences shown in Figure 1, Figure 3, Figure 5, 
Figure 8 or Figure 10. The invention further provides nucleic acid sequences 
10 encoding fragments or the complement of the polynucleotide sequences, as well 
as expression vectors and host cells comprising polynucleotides that encode 
HOMPS. 

The invention further provides methods for detecting the presence and 
status of HOMPS polynucleotides and proteins in various biological samples, as 
15 well as methods for identifying cells that express HOMPS. A typical embodiment 
of this invention provides methods for monitoring HOMPS gene products in a 
tissue sample having or suspected of having some form of growth deregulation 
such as cancer. 

BRIEF DESCRIPTION OF THE FIGURES 
20 Figure 1 shows a nucleic acid sequence of HI 7 (SEQ ID NO: 1). 

Figure 2 shows an amino acid sequence of HI 7 (SEQ ID NO: 2). 
Figure 3 shows a nucleic acid sequence of C40 (SEQ ID NO: 3). 
Figure 4 shows an amino acid sequence of C40 (SEQ ID NO: 4). 
Figure 5 shows a nucleic acid sequence of H41 (SEQ ID NO: 5). 
25 Figure 6 shows an amino acid sequence of H41 (SEQ ID NO: 6). 

Figure 7 shows a nucleic acid sequence of H63 (SEQ ID NO: 7). 
Figure 8 shows an nucleic acid sequence of H13 (SEQ ID NO: 8). 
Figure 9 shows an amino acid sequence of HI 3 (SEQ ID NO: 9). 
Figure 10 shows a nucleic acid sequence of H14 (SEQ ID NO: 10). 
30 Figure 11 shows an amino acid sequence of HI 4 (SEQ ID NO: 1 1). 

Figure 12 shows a Northern blot analysis of candidate gene expression. 
Differential expression patterns were confirmed by Northern blot analyses for 5 
C clones and 1 1 H clones. For clones H35 and H45, 20 ng of total RNA was 
loaded in each lane. For the remaining clones, 2 fig of poly (A) RNA was 
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loaded ( C = MCF-7 /control mRNA; H = MCF-7/HER-2 mRNA. ) Ethidium 
bromide staining of RNA gel is shown below autoradiograrns to illustrate equal 
loading and quality of RNA. The size of the differentially expressed transcript is 
indicated on the left. 

5 Figure 13 shows an in vitro transcription / translation of the proteins 

from the identified differentially expressed transcripts. The transcription-coupled 
translation reaction was performed using T3 RNA polymerase, rabbit reticulocyte 
lysate and [ 35 S] methionine labeling. The first lane represents the 61 kDa 
luciferase protein product which was used as a positive control. The C40, HI 3, 
10 H17, and H37 protein products are seen as distinct bands at 55, 30, 50, and 90 
kDa, respectively, whereas the H41 cDNA produced two faint bands at 30 kDa 
and a lower molecular weight. The protein molecular weight marker is shown on 
the left 

Figure 14 shows a schematic representation of the three differentially 

15 expressed novel genes. The thin line indicates a stretch of nucleotide sequences 
ending at poly A tail, denoted by (A) N , at the base pair number written next. The 
filled box illustrates location of the most probable open reading frame, with the 
numbers below indicating the base pair positions of start and stop codons 
respectively. A, Map of the C40 cDNA, the leucine zipper motif is denoted by 

20 LZ and shown above is the corresponding amino acid positions and sequences. 
B, Map of the HI 7 cDNA, the asterisks above poly A tail indicate the presence 
of polyadenylation signals. The hatched box illustrates the amino acid region of 
shared homology and the sequence alignments are shown below the gene. 
Identical residues are indicated by shading (Z77667 (SEQ ID NO: 12) = a C 

25 eUgans cDNA of unknown function, AE001086 (SEQ ID NO: 13) = sarcosine 
oxidase). Numbers at the right represent corresponding amino acid positions. 
Gaps introduced for maximal alignment are marked with dashes. C, Map of the 
H41 cDNA, NLS = nuclear localization signal, AF005858 (SEQ ID NO: 14) = 
one of the "fast evolving" drosophila genes of unknown function. 

30 Figure 15 shows the confirmation of differential expression in CaOv-3 

ovarian cancer cells overexpressing HER-2. The differential expression patterns 
of three C clones and nine H clones identified in MCF-7 breast cancer cells were 
reproduced in CaOv-3 ovarian cancer cell counterparts on Northern blot (C = 
Control, H = HER-2 trans fectant.) 20 jAg of total RNA was loaded in each lane. 



6 



WO 01/25250 



PCTAJS00/27649 



Ethidium bromide staining of 18 S ribosomal RNA is shown as a loading control 
below autoradiograms. The transcript sizes are as shown in Figure 1 2. 

Figure 16 shows that the upregulation of the H37 and H41 transcripts 
correlates with HER-2/neu overexpression in human breast tumors (p<0.005 and 
5 p<0.075 respectively). A, Northern blot analysis was performed to compare 
expression levels of the HER-2 vs. H37 cDNAs in 15 individual breast tumor 
samples. Ten jig of total RNA was loaded in each lane, and the same blot was 
stripped for rehybridization with the second probe. B, The expression levels of 
the HER-2 vs. H41 cDNAs were analyzed in a separate Northern blot 

10 experiment. The same set of breast tumor samples were used as in panel A 
except that the #16 tumor was substituted for the #14 due to depletion of the 
sample. Fifteen |ig of total RNA was loaded in each lane except for tumor #15 
for which only 5 jig were used because of lack of material. The blot was stripped 
as in A. For both A and B, ethidium bromide staining of 28 S ribosomal RNA is 

15 shown below the autoradiograms for RNA loading control 

DESCRIPTION OF THE INVENTION 

Unless otherwise defined, all terms of art, notations and other scientific 
terminology used herein are intended to have the meanings commonly 

20 understood by those of skill in the art to which this invention pertains. In some 
cases, terms with commonly understood meanings are defined herein for clarity 
and/or for ready reference, and the inclusion of such definitions herein should 
not necessarily be construed to represent a substantial difference over what is 
generally understood in the art The techniques and procedures described or 

25 referenced herein are generally well understood and commonly employed using 
conventional methodology by those skilled in the art, such as, for example, the 
widely utilized molecular cloning methodologies described in Sambrook et al, 
1989, Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. As appropriate, procedures 

30 involving the use of commercially available kits and reagents are generally carried 
out in accordance with manufacturer defined protocols and/ or parameters unless 
otherwise noted. 

In addition, a variety of art accepted definitions and methods for 
manipulating, evaluating and utilizing polypeptide and polynucleotide sequences 
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are well known in. the art and are widely used as a standard practice in the field of 
biotechnology. Such common terms and practices are provided, for example in 
U.S. Patent No. 5,922,566, which is incorporated herein by reference and which 
recites a variety of the common terms and methodologies illustrated below. 
5 Before the present proteins, nucleotide sequences, and methods are 

described, it is understood that this invention is not limited to the particular 
methodology, protocols, cell lines, vectors, and reagents described as these may 
vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only and is not intended to Limit 
10 the scope of the present invention which will be limited only by the appended 
claims. 

It must be noted that as used herein and in the appended claims, the 
singular forms "a", "an", and "the" include plural reference unless the context 
clearly dictates otherwise. Thus, for example, reference to "a host celT includes a 
15 plurality of such host cells, reference to the "antibody 1 is a reference to one or 
more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meanings as commonly understood by one of ordinary skill in the 

20 art to which this invention belongs. Although any methods and materials similar 
or equivalent to those described herein can be used in the practice or testing of 
the present invention, the preferred methods, devices, and materials are now 
described. All publications mentioned herein are incorporated herein by reference 
for the purpose of describing and disclosing the cell lines, vectors, and 

25 methodologies which are reported in the publications which might be used in 
connection with the invention. Nothing herein is to be construed as an admission 
that the invention is not entitled to antedate such disclosure by virtue of prior 
invention. 

30 DEFINITIONS 

As used herein, the term "polynucleotide" means a polymeric form of 
nucleotides of at least about 10 bases or base pairs in length, either 
ribonucleotides or deoxynucleotides or a modified form of either type of 
nucleotide, and is meant to include single and double stranded forms of DNA. 
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As used herein, the term "polypeptide" means a polymer of at least about 6 
amino acids. Throughout the specification, standard three letter or single letter 
designations for amino acids are used. 

M Nucleic acid sequence", as used herein, refers to an oligonucleotide, 
5 nucleotide, or polynucleotide, and fragments or portions thereof, and to DNA or 
RNA of genomic or synthetic origin which may be single- or double-stranded, 
and represent the sense or antisense strand. Similarly, "amino acid sequence", as 
used, herein refers to an oligopeptide, peptide, polypeptide, or protein sequence, 
and fragments or portions thereof, and to naturally occurring or synthetic 
10 molecules. 

Where "amino acid sequence" is recited herein to refer to an amino acid 
sequence of a naturally occurring protein molecule, "amino acid sequence" and 
like terms, such as "polypeptide" or "protein" are not meant to limit the amino 
acid sequence to the complete, native amino acid sequence associated with the 

15 recited protein molecule. 

"Peptide nucleic acid", as used herein, refers to a molecule which 
comprises an oligomer to which an amino acid residue, such as lysine, and an 
amino group have been added. These small molecules, also designated anti-gene 
agents, stop transcript elongation by binding to their complementary strand of 

20 nucleic acid (Nielsen, P. E. et al. (1993) Anticancer Drug Des. 8:53-63). 

HOMPS, as used herein, refers to the amino acid sequences of substantially 
purified HOMPS obtained from any species, particularly mammalian, including 
bovine, ovine, porcine, murine, equine, and preferably human, from any source 
whether natural, synthetic, semi-synthetic, or recombinant 

25 "Consensus", as used herein, refers to a nucleic acid sequence which has been 
resequenced to resolve uncalled bases, or which has been extended using the XL- 
PCR kit (Perkin Elmer, Norwalk, Conn. ) in the 5 1 and/or the 3' direction and 
resequenced, or which has been assembled from the overlapping sequences of 
more than one clone using the GELVIEW Fragment Assembly system (GCG, 

30 Madison, Wis. ), or which has been both extended and assembled. 

A "variant" of HOMPS, as used herein, refers to an amino acid sequence 
that is altered by one or more amino acids. The variant may have "conservative" 
changes, wherein a substituted amino acid has similar structural or chemical 
properties, e. g. , replacement of leucine with isoleucine. More rarely, a variant 
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may have "nonconservative" changes, e. g. , replacement of a glycine with a 
tryptophan. Similar minor variations may also include amino acid deletions or 
insertions, or both. Guidance in deterniining which amino acid residues may be 
substituted, inserted, or deleted without abolishing biological or immunological 
5 activity may be found using computer programs well known in the art, for 
example, LASERGENE software. 

A "deletion' 1 , as used herein, refers to a change in either amino acid or 
nucleotide sequence in which one or more amino acid or nucleotide residues, 
respectively, are absent. 
10 An "insertion" or "addition", as used herein, refers to a change in an amino acid 
or nucleotide sequence resulting in the addition of one or more amino acid or 
nucleotide residues, respectively, as compared to the naturally occurring 
molecule. 

A "substitution", as used herein, refers to the replacement of one or more 
15 amino acids or nucleotides by different amino, acids or nucleotides, respectively. 

The term "biologically active", as used herein, refers to a protein having 
structural, regulatory, or biochemical functions of a naturally occurring molecule. 
Likewise, "immunologically active" refers to the capability of the natural, 
recombinant, or synthetic HOMPS, or any oligopeptide thereof, to induce a 
20 specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The term "agonist", as used herein, refers to a molecule which, when 
bound to HOMPS, causes a change in HOMPS which modulates the activity of 
HOMPS. Agonists may include proteins, nucleic acids, carbohydrates, or any 
25 other molecules which bind to HOMPS. 

The terms "antagonist" or "inhibitor", as used herein, refer to a molecule 
which, when bound to HOMPS, blocks or modulates the biological or 
immunological activity of HOMPS. Antagonists and inhibitors may include 
proteins, nucleic acids, carbohydrates, or any other molecules which bind to 
30 HOMPS. 

The term "modulate", as used herein, refers to a change or an alteration 
in the biological activity of HOMPS. Modulation may be an increase or a 
decrease in protein activity, a change in binding characteristics, or any other 
change in the biological, functional or immunological properties of HOMPS. 
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The term "mimetic", as used herein, refers to a molecule, the structure of 
which is developed from knowledge of the structure of HOMPS or portions 
thereof and, as such, is able to effect some or all of the actions related to the 
human Her-2/ neu overexpression modulated proteins. 
5 The term "derivative", as used herein, refers to the chemical modification 

of a nucleic acid encoding HOMPS or the encoded HOMPS. Illustrative of such 
modifications would be replacement of hydrogen by an alkyl, acyl, or amino 
group, A nucleic acid derivative would encode a polypeptide which retains 
essential biological characteristics of the natural molecule. 

10 The term "substantially purified", as used herein, refers to nucleic or 

amino acid sequences that are removed from their natural environment, isolated 
or separated, and are at least 60% free, preferably 75% free, and most preferably 
90% free from other components with which they are naturally associated. 

"Amplification", as used herein, refers to the production of additional 

15 copies of a nucleic acid sequence and is generally carried out using polymerase 
chain reaction (PGR) technologies well known in the art (Dieffenbach, C. W. and 
G. S. Dveksler (1995) PCR Primer, a Laboratory Manual, Cold Spring Harbor 
Press, Plainview, N. Y. ). 

The term "hybridization", as used herein, refers to any process by which a 

20 strand of nucleic acid binds with a complementary strand through base pairing 
The term "hybridization complex", as used herein,, refers to a complex formed ' 
between two nucleic acid sequences by virtue of the formation of hydrogen 
bonds between complementary G and C bases and between complementary A 
and T bases; these hydrogen bonds may be further stabilized by base stacking 

25 interactions. 

The two complementary nucleic acid sequences hydrogen bond in an antiparallel 
configuration. A hybridization complex may be formed in solution (e. g. or R^ 
analysis) or between one nucleic acid sequence present in solution and another 
nucleic acid sequence immobilized on a solid support (e. g. , membranes filters, 
30 chips, pins or glass slides to which cells have been fixed for in situ hybridization). 

The terms "complementary" or "complementarity", as used herein, refer 
to the natural binding of polynucleotides under permissive salt and temperature 
conditions by base-pairing. For example, the sequence "A-G-T" binds to the 
complementary sequence "T-C-A". Complementarity between two single- 
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stranded molecules may be "partial", in which only some of the nucleic acids 
bind, or it may be complete when total complementarity exists between the single 
stranded molecules. The degree of complementarity between nucleic acid strands 
has significant effects on the efficiency and strength of hybridization between 
5 nucleic acid strands. This is of particular importance in amplification reactions, 
which depend upon binding between nucleic acids strands. 

The term "homology", as used herein, refers to a degree of 
complementarity. There may be partial homology or complete homology (i. e. , 
identity). A partially complementary sequence is one that at least partially inhibits 

10 an identical sequence from hybridizing to a target nucleic acid; it is referred to 
using the functional term "substantially homologous. " The inhibition of 
hybridization of the completely complementary sequence to the target sequence 
may be examined using a hybridization assay (Southern or northern blot, solution 
hybridization and the like) under conditions of low stringency. A substantially 

15 homologous sequence or probe will compete for and inhibit the binding (l e. , 
the hybridization) of a completely homologous sequence or probe to the target 
sequence under conditions of low stringency. This is not to say that conditions of 
low stringency are such that non-specific binding is permitted; low stringency 
conditions require that the binding of two sequences to one another be a specific 

20 (i. e. , selective) interaction. The absence of non-specific binding may be tested by 
the use of a second target sequence which lacks even a partial degree of 
complementarity (e. g. , less than about 30% identity); in the absence of non- 
specific binding, the probe will not hybridize to the second non-complementary 
target sequence. 

25 As used herein, the terms "hybridize", "hybridizing", "hybridizes" and 

the like, used in the context of polynucleotides, are meant to refer to 
conventional hybridization conditions, preferably such as hybridization in 50% 
formamide/6XSSC/0.1% SDS/100 Hg/ml ssDNA, in which temperatures for 
hybridization are above 37 degrees C and temperatures for washing in 0.1 X 

30 SSC/0.1% SDS are above 55 degrees C, and most preferably to stringent 
hybridization conditions. 

"Stringency" of hybridization reactions is readily determinable by one of 
ordinary skill in the art, and generally is an empirical calculation dependent upon 
probe length, washing temperature, and salt concentration. In general, longer 
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probes require higher temperatures for proper annealing, while shorter probes 
need lower temperatures. Hybridization generally depends on the ability of 
denatured DNA to reanneal when complementary strands are present in an 
environment below their melting temperature. The higher the degree of desired 
5 homology between the probe and hybridizable sequence, the higher the relative 
temperature that can be used. As a result, it follows that higher relative 
temperatures would tend to make the reaction conditions more stringent, while 
lower temperatures less so. For additional details and explanation of stringency of 
hybridization reactions, see Ausubel et al, Current Protocols in Molecular 

10 Biology, Wiley Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, 
may be identified by those that: (1) employ low ionic strength and high 
temperature for washing, for example. 0.015 M sodium chloride/0,0015 M 
sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during 

15 hybridization a denaturing agent, such as formamide, for example, 50% (v/v) 
formamide with 0.1% bovine serum aibumin/0.1% Ficoll/0.1% 
polyvinylpyrrolidone/ 50mM sodium phosphate buffer at pH 6.5 with 750 mM 
sodium chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 
x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 

20 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon 
sperm DNA (50 ^g/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with 
w.ishes at 42°C in 0.2 x SSC (sodium chloride/sodium, citrate) and 50% 
formamide at 55°C, followed by a high-stringency wash consisting of 0.1 x SSC 
containing EDTA at 55°C. 

25 "Moderately stringent conditions" may be identified as described by 

Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, New York: 
Cold Spring Harbor Press, and include the use of washing solution and 
hybridization conditions (e.g., temperature, ionic strength and %SDS) less 
stringent than those described above. An example of moderately stringent 

30 conditions is overnight incubation at 37°C in a solution comprising: 20% 
formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 
mg/mL denatured sheared salmon sperm DNA, followed by washing the filters 
in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust the 
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temperature, ionic strength, etc. as necessary to accommodate factors such as 
probe length and the like. 

In the context of amino acid sequence comparisons, the term "identity" is 
used to express the percentage of amino acid residues at the same relative 
5 positions that are the same. Also in this context, the term "homology" is used to 
express the percentage of amino acid residues at the same relative positions that 
are either identical or are similar, using the conserved amino acid criteria of 
BLAST analysis, as is generally understood in the art. For example, % identity 
values may be generated by WU-BLAST-2 (Altschul et al., 1996, Methods in 

10 Enzymology 266:460-480; http.7/blastwusd/edu/blast/README.html). 
Further details regarding amino acid substitutions, which are considered 
conservative under such criteria, are provided below. 

As will be understood by those of skill in the art, the stringency of 
hybridization may be altered in order to identify or detect identical or related 

15 polynucleotide sequences. 

The term "antisense", as used herein, refers to nucleotide sequences 
which are complementary to a specific DNA or RNA sequence. The term 
"antisense strand" is used in reference to a nucleic acid strand that is 
complementary to the "sense" strand. Antisense molecules may be produced by 

20 any method, including synthesis by ligating the gene(s) of interest in a reverse 
orientation to a viral promoter which permits the synthesis of a complementary 
strand. Once introduced into a cell, this transcribed strand combines with natural 
sequences produced by the cell to form duplexes. These duplexes then block 
either the further transcription or translation. In this manner, mutant phenotypes 

25 may be generated. The designation "negative" is sometimes used in reference to 
the antisense strand, and "positive" is sometimes used in reference to the sense 
strand. 

The term "portion", as used herein, with regard to a protein (as in "a 
portion of a given protein") refers to fragments of that protein. The fragments 
30 may range in size from four amino acid residues to the entire amino acid 
sequence minus one amino acid. Thus, a protein "comprising at least a portion of 
the amino acid sequence of Figure 2, Figure 4, Figure 6, Figure 9 or Figure 11" 
encompasses the full-length human HOMPS and fragments thereof. 



14 



WO 01/25250 



PCT/US00/27649 



"Transformation", as defined herein, describes a process by which 
exogenous DNA enters and changes a recipient cell It may occur under natural 
or artificial conditions using various methods well known in the art 
Transformation may rely on any known method for the insertion of foreign 
5 nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method is 
selected based on the host cell being transformed and may include, but is not 
limited to, viral infection, electroporation, lipofection, and particle bombardment. 
Such "transformed" cells include stably transformed ceUs in which the inserted 
DNA is capable of replication either as an autonomously replicating plasmid or 

10 as part of the host chromosome. They also include cells which transiendy express 
the inserted DNA or RNA for limited periods of time. 

The term "antigenic determinant", as used herein, refers to that portion 
of a molecule that makes contact with a particular antibody (i. e. an epitope). 
When a protein or fragment of a protein is used to immunize a host animal, 

15 numerous regions of the protein may induce the production of antibodies which 
bind specifically to a given region or three-dimensional structure on the protein; 
these regions or structures are referred to as antigenic determinants. An antigenic 
determinant may compete with the intact antigen (i. e. , the immunogen used to 
elicit the immune response) for binding to an antibody. 

20 The terms "specific binding" or "specifically binding", as used herein, in 

reference to the interaction of an antibody and a protein or peptide, mean that 
the interaction is dependent upon the presence of a particular structure (i. e., the 
antigenic determinant or epitope) on the protein; in other words, the antibody is 
recognizing and binding to a specific protein structure rather than to proteins in 

25 general For example, if an antibody is specific for epitope "A", the presence of a 
protein containing epitope A (or free, unlabeled A) in a reaction containing 
labeled "A" and the antibody will reduce the amount of labeled A bound to the 
antibody. 

The term "sample", as used herein, is used in its broadest sense. A 
30 biological sample suspected of containing nucleic acid encoding HOMPS or 
fragments thereof may comprise a cell, chromosomes isolated from a cell (e. g. , a 
spread of rnetaphase chromosomes), genomic DNA (in solution or bound to a 
solid support such as for Southern analysis), RNA (in solution or bound to a 
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solid support such as for northern analysis), cDNA (in solution or bound to a 
solid support), an extract from cells or a tissue, and the like. 

The term "correlates with expression of a polynucleotide", as used herein, 
indicates that the detection of the presence of ribonucleic acid that is similar to 
5 Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10 by northern analysis is 
indicative of the presence of mRNA encoding HOMPS in a sample and thereby 
correlates with expression of the transcript from the polynucleotide encoding the 
protein. 

"Alterations" in the polynucleotide of Figure 1, Figure 3, Figure 5, Figure 8 or 
10 Figure 10, as used herein, comprise any alteration in the sequence of 
polynucleotides encoding HOMPS including deletions, insertions, and point 
mutations that may be detected using hybridization assays. Included within this 
definition is the detection of alterations to the genomic DNA sequence which 
encodes HOMPS (e. g. , by alterations in the pattern of restriction fragment 
15 length polymorphisms capable of hybridizing to Figure 1, Figure 3, Figure 5, 
Figure 8 or Figure 10), the inability of a selected fragment of Figure 1, Figure 3, 
Figure 5, Figure 8 or Figure 10 to hybridize to a sample of genomic DNA (e. g. , 
using allele-specific oligonucleotide probes), and improper or unexpected 
hybridization, such as hybridization to a locus other than the normal 
20 chromosomal locus for the polynucleotide sequence encoding HOMPS (e. g. , 
using fluorescent in situ hybridization [FISH] to metaphase chromosomes 
spreads). 

As used herein, the term "antibody" refers to intact molecules as well as 
fragments thereof, such as Fab, F(ab') 2 and Fv, which are capable of binding the 

25 epitopic determinant. Antibodies that bind HOMPS polypeptides can be 
prepared using intact polypeptides or fragments containing small peptides of 
interest as the immunizing antigen. The polypeptide or peptide used to immunize 
an animal can be derived from the transition of RNA or synthesized chemically, 
and can be conjugated to a carrier protein, if desired; Commonly used carriers 

30 that are chemically coupled to peptides include bovine serum albumin and 
thyroglobulin. The coupled peptide is then used to immunize the animal (e. g. , a 
mouse, a rat, or a rabbit). 

The term "humanized antibody", as used herein, refers to antibody 
molecules in which arnino acids have been replaced in the non-antigen binding 
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regions in order to more closely resemble a human antibody, while still retaining 
the original binding ability. 

Additional definitions are provided throughout the subsections that 

follow, 

5 

THE INVENTION 

The invention is based on the discovery of new human Her-2/neu 
overexpression modulated proteins (HOMPS), the polynucleotides encoding 
HOMPS, and the use of these compositions for the evaluation and 

10 characterization of disorders associated with abnormal cellular proliferation. 

Amplification and resulting overexpression of the HER-2/neu proto- 
oncogene is found in approximately 30% of human breast and 20% of human 
ovarian cancers. To better understand the molecular events associated with 
overexpression of this gene in human breast cancer cells, differential 

15 hybridization was used to identify genes whose expression levels are altered in 
cells overexpressing this receptor. As illustrated below, of 1 6,000 clones screened 
from an overexpression cell cDNA library, a total of 19 non-redundant clones, 
were isolated including 7 whose, expression decreases (C clones) and 12 which 
increase (H clones) in association with HER-2/ neu overexpression. Of these, 5 C 

20 clones and 11 H clones have been confirmed to be differentially expressed by 
Northern blot analysis. This group includes nine genes of known function, three 
previously sequenced genes of relatively uncharacterized function and four novel 
genes without match in GenBank. Examination of the previously characterized 
genes indicates that they represent sequences known to be frequently associated 

25 with the malignant phenotype, suggesting that the subtraction cloning strategy 
used identified appropriate target genes. In addition, differential expression of 12 
of 16 (75%) cDNAs identified in the breast cancer cell lines are also seen in 
HER-2/w/* overexpressing ovarian cancer cells, indicating that they represent 
generic associations with HER-2/ww overexpression. Finally, upregulation of 

30 two of the identified cDNAs, one novel and one identified but as yet 
uncharacterized gene, was confirmed in human breast cancer specimens in 
association with HYLK-2/neu overexpression. Further characterization of these 
genes may yield insight into the fundamental biology and pathogenetic effects of 
HER-2/ neu overexpression in human breast and ovarian cancer cells. 
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To more directly evaluate the potential biologic role of HER-2/neu 
overexpression in the pathogenesis of breast cancer, disclosed herein is an 
experimental model in which the effects of overexpression in human breast 
cancer cells can be studied (Pietras, R. J., Fendly, B. M., Chazin, V. R., Pegram, 
5 M. D., Howell, S. B. and Slamon, D. J. (1994) Oncogene, 9(7), 1829-38; Pietras, R. 
J., Arboleda, J., Reese, D. M., Wongvipat, N., Pegram, M. D., Ramos, L., 
Gorman, C. M., Parker, M. G, Sliwkowski, M. X. and Slamon, D. J. (1995) 
Oncogene, 10(12), 2435-46; Pegram, M. D., Finn, R. S., Arzoo, K., Beryt, M., 
Pietras, R. J. and Slamon, D. J. (1997) Oncogene, 15(5), 537-47; Chazin, V.R. 

10 (1991). The biologic effects of HER-2//hw proto-oncogene overexpression, 
Chapter 2. Department of Microbiology and Immunology, University of 
California, Los Angeles). In these experiments, human MCF-7 breast cancer cells 
which express normal levels of the receptor were transfected with a retroviral 
expression vector containing a full length cDNA encoding the human HER-2 

15 gene (Chazin, V. R., Kaleko, M., Miller, A. D. and Slamon, D. J. (1992) Oncogene, 
7(9), 1859-66). Multiple rounds of infection and sorting of the top 5% of HER-2 
overexpressing, pooled transfectants generated a stably transfected cell line, 
MCF-7/HER-2, in which HER-2 expression levels were comparable to those 
observed in human HER-2 overexpressing breast cancer specimens (Pegram, M. 

20 D., Finn, R. S., Arzoo, K., Beryt, M., Pietras, R. J. and Slamon, D. J. (1997) 
Oncogene, 15(5), 537-47). MCF-7 cells were similarly transfected using an empty 
neomycin resistance vector including multiple infections, producing the MCF- 
7 /control cell line. The amount of HER-2//W protein expressed, as determined 
by quantitative Western blot analysis, was approximately 1 .62 pg/cell for MCF- 

25 7/HER-2 cells as compared to 0.36 pg/cell for MCF-7/control cells (Press, M. 
F., Pike, M. C, Chazin, V. R., Hung, G., Udove, J. A., Markowicz, M., Danyluk, 
]., Godolphin, W., Sliwkowski, M., Altita, R. and et al. (1993) Cancer Rts, 53(20), 
4960-70). In vitro and in vivo studies of these engineered cells demonstrated that 
the growth characteristics of the MCF-7/HER-2 human breast cancer cell line 

30 are significantiy altered by the overexpression of HEK-2/neu (Pietras, R. J., 
Fendly, B. M., Chazin, V. R., Pegram, M. D., Howell, S. B. and Slamon, D. J. 
(1994) Oncogene, 9(7), 1829-38; Pietras, R. J., Arboleda, J., Reese, D. M., 
Wongvipat, N., Pegram, M. D., Ramos, L., Gorman, C. M., Parker, M. G., 
Sliwkowski, M. X. and Slamon, D. J. (1995) Oncogene, 10(12), 2435-46; Pegram, M. 
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D., Finn, R. S., Arzoo, K., Beryt, M, Pietras, R. J. and Slamon, D. J. (1997) 
Oncogene, 15(5), 537-47; Chazin, V.R. (1991). The biologic effects of HER-2/**// 
proto-oncogene overexpression, Chapter 2. Department of Microbiology and 
Immunology, University of California, Los Angeles; Chazin, V. R., Kaleko, M., 
5 Miller, A. D. and Slamon, D. J. (1992) Oncogene, 7(9), 1859-66). Increased cell 
proliferation was seen in the HER-2 overexpressing cell line as assessed by ^H- 
thymidine incorporation and in vitro cell proliferation assays. In addition, HER-2 
overexpression markedly improved soft agar cloning efficiency, and the cells 
exhibited increased tumorigenicity in nude mice ( Chazin, V.R. (1991). The 

10 biologic effects of HER-2///«y proto-oncogene overexpression, Chapter 2. 
Department of Microbiology and Immunology, University of California, Los 
Angeles; Chazin, V. R., Kaleko, M., Miller, A. D. and Slamon, D. J. (1992) 
Oncogene, 7(9), 1859-66). Together, the data confirmed that overexpression of the 
HER-2 receptor tyrosine kinase plays a role in altering the biologic behavior of 

15 human breast cancer cells. The exact molecular mechanism(s) by which this 
overexpression promotes a more aggressive phenotype of these cells, however, 
remains unknown. There are multiple potential mechanisms by which the 
observed phenotypic changes may occur. Increased amounts and/or activation 
of this cell surface receptor may affect either the expression or function of other 

20 molecules involved in regulation of cell proliferation. Direct effects of HER-2 
overexpression on other cellular proteins can be accomplished by changes in 1) 
expression at the mRNA transcript level,. 2) protein production at the 
translational level, or 3) protein activation / modification at the post-translational 
level. The cellular changes associated with HER-2/ neu overexpression are likely 

25 to be induced by most or all of these mechanisms. To identify those changes 
associated with differential expression of genes at the transcript level, we 
undertook a differential screening analysis. 

The subtraction cloning technique termed differential hybridization, also 
known as plus/minus screening (St John, T. P. and Davis, R. W. (1979) Ceil, 

30 16(2), 443-52), was used to isolate genes which are differentially expressed in 
MCF-7/HER-2 cells as compared to MCF-7/control cells. This approach has 
the advantage of comparing two human breast cancer cell lines which are 
identical except for HER-2/ neu overexpression allowing for a direct comparison 
of cDNAs derived from the two cell populations. In the current study, we 
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identified a series of genes, either previously characterized or entirely novel, 
whose expression levels are altered in association with HER-2/*w 
overexpression. It is possible that some of these genes might be mediators of the 
HER-2 overexpressing phenotype since we have confirmed their differential 
5 expression not only in human ovarian cancer cell lines which overexpress HER-2 
but also primary breast cancer specimens containing this alteration. 

The differential screening approach compared MCF-7 breast cancer cell 
lines transfected with a human HER-2/ w# cDNA (MCF-7/HER-2) or with an 
identical empty vector (MCF-7/control). The alternative approach of comparing 
10 two different non-engineered cell lines which are not isogenic i.e. MCF-7 and/or 
MDA-MB-231 compared against SKBR3 and/or BT-474, respectively, is 
problematic in that the presence of non-HER-2 associated genetic differences 
unique to cells derived from different individuals would almost certainly 
complicate interpretation of results. Such heterogenetic effects would confound 
15 identification of those genes which are differentially expressed in direct 
association with ¥LEK-2/neu overexpression. A relatively conventional 
subtraction cloning method termed differential hybridization has been 
successfully used by other investigators in the cloning of genes associated with 
various biologic phenomenon including the galactose-inducible genes of yeast (St 
20 John, T. R and Davis, R. W. (1979) Cell, 16(2), 443-52), human fibroblast 
interferon (Taniguchi, T., Fujii-Kuriyama, Y. and Muramatsu, M. (1980) Proc Nail 
Acad Sci USA, 77(7), 4003-6), a variety of heat-shock proteins (Mason, I. J., 
Taylor, A., Williams, J. G., Sage, H. and Hogan, B. L. (1986) Embo /, 5(7), 1465- 
72), and the metastasis suppressor gene nm-23 (Steeg, P. S., Bevilacqua, G., 
25 Kopper, L., Thorgeirsson, U. P., Talmadge, J. E., Liotta, L. A. and Sober, M. E. 
(1988) ] Natl Cancer Inst, 80(3), 200-4). This screening strategy has the advantage 
of obtaining a high yield of full-length clones in contrast to more recent 
techniques such as differential display or representational difference analysis 
(RDA) which require an additional procedure of screening a cDNA library using 
30 the DNA fragments obtained. 

From our initial screen of 16,000 MCF-7/HER-2 cDNA library clones, 
we identified five genes with decreased and eleven genes with increased 
expression levels in association with HER-2///<w overexpression. These clones 
include nine genes with previously identified cellular functions, three existing 
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sequences of relatively uncharacterized function, and four novel genes without 
matching sequences in GenBank. A number of the known genes identified in 
our screening have been previously reported to be associated with several aspects 
of human breast cancer and/or tumorigenicity in general Although the 
5 differential screening approach does not provide direct evidence that a given gene 
plays a critical role in the phenotypic changes associated with HER-2 
overexpression, a review of the literature regarding some of the genes in our 
study indicates that they may be candidates. Recent data, for example, indicates 
that downregulation of cytokeratin (C29, C49) gene expression may result in 
10 disorganization of the cytoskeleton leading to enhanced invasive properties 
(Mukhopadhyay, T. and Roth, J. A. (1996) Anticancer Res, 16(1), 105-12). 
Similarly, the gamma actin (C72) transcript level is markedly decreased in salivary 
gland adenocarcinoma cells on acquisition of metastatic ability (Suzuki, H., 
Nagata, H., Shimada, Y. and Konno, A. (1998) Int J Oncol, 12(5), 1079-84). These 
15 observations are consistent with our findings and are of interest given the fact 
that HER-2 overexpression is associated with increased metastatic potential 
(Kennedy, M. J. (1996) Curr Opin Oncol, 8(6), 485-90; Pantel, K., Schlimok, G., 
Braun, S., Kutter, D., Lindemann, F., Schaller, G., Funke, I., Izbicki, J. R. and 
Riethmuller, G. (1993) / Natl Cancer Inst, 85(17), 1419-24; KaUioniemi, O. P., 
20 Hoili, K., Visakorpi, T, Koivula, T., Helin, H. H. and Isola, J. J. (1991) Int J 
Cancer, 49(5), 650-5). The observation that Cathepsin D (C31) transcript level is 
decreased in HER-2 overexpressing breast cancer cells is consistent with the 
most recent clinical data (Johnson, M. D., Torri, J. A., Lippman, M. E. and 
Dickson, R. B. (1993) Cancer Res, 53(4), 873-7; Ravdin, P. M., Tandon, A. K., 
25 AUred, D. C, Clark, G. M., Fuqua, S. A., Hilsenbeck, S. H., Chamness, G. C. and 
Osborne, C. K. (1994) / Clin Oncol, 12(3), 467-74) which contradict the original 
reports of high Cathepsin D concentrations as indicative of a poorer prognosis 
(Thorpe, S. M., Rochefort, H., Garcia, M., Freiss, G., Christensen, I. J., Khalaf, 
S., Paolucci, F., Pau, B., Rasmussen, B. B. and Rose, C. (1989) Cancer Res, 49(21), 
30 6008-14). The 90 kDa heat shock protein (HI 8) forms highly stable complexes 
with the estrogen receptor and thus may play a role in mediating estrogen- 
dependent growth (Ramachandran, C, Catelli, M. G., Schneider, W. and 
Shyamala, G. (1988) Endocrinology, 123(2), 956-61; Shyamala, G., Gauthier, Y., 
Moore, S. K., Catelli, M. G. and Ullrich, S. J. (1989) Mol Cell Biol, 9(8), 3567-70). 
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Its potential role in regulating estrogen receptor activity in human breast cancer is 
interesting in light of the interactions recently described between HER-2 and the 
estrogen receptor (Carlomagno, C, Perrone, F., Gallo, G, De Laurentiis, M., 
Lauria, R., Morabito, A., Pettinato, G., Panico, L., D'Antonio, A., Bianco, A. R. 
5 and De Placido, S. (1996) / Clin Oncol, 14(10), 2702-8; Ignar-Trowbridge, D. M., 
Nelson, K. G., Bidwell, M. C, Curtis, S. W., Washburn, T. F., McLachlan, J. A. 
and Korach, K. S. (1992) Prvc Natl Acad Sci USA, 89(10), 4658-62). Other 
known genes found in our screening to be overexpressed in association with 
HER-2 overexpression, ribosomal proteins L8 (HI 6) and LLrep3(H35), GAPDH 

10 (H31), and succinyl coA transferase (H45), may be merely reflective of higher 
proliferation in HER-2 overexpressing tumors. Alternatively, differential 
expression of these genes may be more specifically linked to HER-2 
overexpression. An example of this could be the LLrep3 which was also 
identified in differential hybridization screening of a ras -trans fected 

15 teratocarcinoma cell line compared to isogenic cell control as increased 25-fold 
(Chiao, P. J, Shin, D. M, Sacks, P. G., Hong, W. K. and Tainsky, M. A. (1992) 
Mol Carcinog, 5(3), 219-31), however, this gene is not differentially expressed when 
comparing nontumorigenic and tumorigenic NIH 3T3 cells transformed by Ha- 
ras, N-ras, v-myc, v-mos, v-src and v-abl (Chiao, P. J., Shin, D. M., Sacks, P. G., 

20 Hong, W. K. and Tainsky, M. A. (1992) MolCarcinog, 5(3), 219-31). 

DNA fragmentation factor (DFF) (HI 3) is also overexpressed in the 
HER-2 overexpressing cells and has recently been identified as a protein which 
functions downstream of caspase-3 during apoptosis (Liu, X., Zou, H., Slaughter, 
C. and Wang, X. (1997) Cell, 89(2), 175-84). Its exact cellular role in this process, 

25 i.e. inhibition or promotion of apoptosis, however, is as yet undefined (Mitamura, 
S., Ikawa, H., Mizuno, N., Kaziro, Y. and Itoh, H. (1998) Biochem Biophys Res 
Commun, 243(2), 480-4; Inohara, N., Koseki, T., Chen, S., Wu, X. and Nunez, G. 
(1998) Embo J, 17(9), 2526-33; Granville, D. J., Jiang, H, An, M. T., Levy, J. G., 
McManus, B. M. and Hunt, D. W. (1998) FEBS Lett, 422(2), 151-4). Lastly, 

30 DRP-1 (Density regulated protein-1) (H14), which is also increased in association 
with HER-2 overexpression has been found to be preferentially expressed in cells 
grown at high density compared to cells at low density. Growth arrest by serum 
starvation or transforming growth factor B treatment does not however induce 
this gene's expression (Deyo, J, E., Chiao, P. J. and Tainsky, M. A. (1998) DNA 
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Cell Biol y 17(5), 437-47). Its role, if any, in the HER-2 phenotype remains to be 
determined. 

The possibility that the pattern of differential gene expression observed in 
this study is unique to a given experimental cell line rather than a generic 
5 phenomenon associated with HER-2 overexpression was also addressed. To 
verify differential expression in another cell line, we utilized CaOv-3 ovarian 
cancer cells engineered to overexpress HER-2/W. For 75% of the differentially 
expressed clones, the patterns identified in the breast cancer cells were also found 
in the human ovarian cancer cell counterparts. This consistent expression 

10 pattern, demonstrated across cell lines from two different epithelia (i.e. breast and 
ovary), suggest that the expression differences observed in our study are related 
to HER-2///W overexpression. In addition, we found a correlation between 
overexpression of HER-2/w# and upregulation of the H37 and H41 genes in 
actual human breast cancer specimens. Those genes which did not yield a signal 

15 on Northern analysis likely due to rare message level are currendy being evaluated 
by a quantitative RT-PCR approach to circumvent this difficulty. Given the 
problem in assessing Northern blot analyses from whole tissue specimens 
resulting from dilutional artifacts introduced by surrounding normal cells, these 
correlations are encouraging. It is intriguing that the H37 cDNA, found to be 

20 overexpressed in HER-2 overexpressing cells in the current study and 
demonstrating convincing differential expression in actual tumor samples, is 
localized to a region of chromosome 3p21.3 alleged to contain a putative lung 
cancer tumor suppressor gene(s) (Wei, M. H., Latif, F., Bader, S., Kashuba, V., 
Chen, J. Y., Duh, F. M, Sekido, Y., Lee, C. C, Geil, L., Kuzmin, I., Zabarovsky, 

25 E., Klein, G., Zbar, B., Minna, J. D. and Lerman, M. I. (1996) Camer as, 56(7), 
1487-92). Further characterization of this gene at the functional and genomic 
levels should give further insight into this phenomenon. 

The current studies indicate that HER-2/ neu overexpression induces a 
pattern of consistent genetic alterations in target human cells. We recognize that 

30 there are more sensitive techniques such as microarray chip technology now 
available for evaluating differential gene expression and plan to reanalyze these 
cell line pairs using these newer approaches. It is possible that some of the genes 
identified may in part be biologic mediators of the aggressive biologic behavior 
associated with HER-2///<w overexpression. Future elucidation of role of these 
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genes, in particular those with as yet unknown function, in mediating malignant 
phenotype should provide further insights into the fundamental biology and 
pathogenetic effects of HER-2/ w# over expression in human breast and ovarian 
cancer cells and suggests novel treatment strategies for patients whose tumors 
5 contain these alterations. 

USES FOR HOMPS GENES AND GENE PRODUCTS DESCRIBED 
HEREIN 

Skilled artisans understand both the great diagnostic value that known 

10 oncogenesis associated markers such as Her-2 and PSA provide in the 
monitoring of cancers in patients as well as the need for the identification of 
additional oncogenesis associated markers (see e.g. Bostwick et al., J Cell 
Biochem Suppl 1996;25:156-64 and Morote et al., Int J Cancer 1999 Aug 
20;84(4):421-5). In particular, artisans understand that oncogenesis is a multistep 

15 process and the identification of a variety of different oncogenesis associated 
markers can be used to identify and characterize precancerous and cancerous 
syndromes earlier and more efficiently (see e.g. Rhim et al., Cancer Res 1990 Sep 
1;50(17 Suppl):5653S-5657 ). In this context, the specific properties of the 
HOMPS proteins described herein (e.g. their modulation by a Her-2, an 

20 oncogene which plays a role significant role in a number of human cancers 
including breast cancer) includes them in the class of oncogenesis associated 
markers that can be used to evaluate and/or evaluate oncogenetic processes in 
cancers. Understandably, a number of the HOMPS proteins disclosed herein 
have been independently identified by other artisans as oncogenesis associated 

25 markers which can be used to examine growth disregulation in conditions such as 
cancer (see e.g. Yano et al., Jpn J Cancer Res 1996 Sep;87(9):908-15 [hsp90]; 
Chou et al., Proc Nad Acad Sci U S A 1987 May;84(9):2575-9 [gamma actin] and 
Tonkin et al., Cancer Prev Control 1999 Apr;3(2):131-6 [cathepsin-D]). 

As disclosed herein, HOMPS gene products exhibit specific properties 

30 that are analogous to those found in a family of genes whose polynucleotides, 
polypeptides and anti-polypeptide antibodies are used in well known diagnostic 
assays directed to examining conditions associated with deregulated cell growth 
such as cancer. Well known members of this class include Her-2 as well as PSA, 
the archetypal markers that have been used by medical practitioners for years to 
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identify and monitor the presence of cancers such as prostate cancer (see e.g. 
Merrill et al, J. Urol. 163(2): 503-5120 (2000); Polascik et al., J. Urol. 
Aug;162(2):293-306 (1999) and Fortier et al., J. Nat. Cancer Inst. 91(19): 1635- 
1640(1999)). A variety of other diagnostic markers are also used in this context 
5 including p53 and K-ras (see e.g. Tulchinsky et al., Int J Mol Med 1999 
Jul;4(l):99-102 and Minimoto et al, Cancer Detect Prev 2000;24(1):1-12). 
Consequendy, this disclosure of the HOMPS polynucleotides and polypeptides 
(as well as the HOMPS polynucleotide probes and anti-HOMPS antibodies used 
to identify the presence of these molecules) and their properties allows skilled 

10 artisans to utilize these molecules in methods that are analogous to those used, 
for example, in a variety of diagnostic assays directed to examining conditions 
associated with cancer. 

Typical embodiments of diagnostic methods which utilize the HOMPS 
polynucleotides, polypeptides and antibodies described herein are analogous to 

15 those methods from well established diagnostic assays which employ Her-2 and 
PSA polynucleotides, polypeptides and antibodies. For example, just as PSA 
polynucleotides are used as probes (for example in Northern analysis, see e.g. 
Sharief et al, Biochem. Mol. Biol. Int. 3 3 (3): 5 67-74(1 994)) and primers (for 
example in PCR analysis, see e.g. Okegawa et al., J. Urol. 163(4): 1189-1190 

20 (2000)) to observe the presence and/or the level of PSA mRNAs in methods of 
monitoring PSA over expression or the metastasis of prostate cancers, the 
HOMPS polynucleotides described herein can be utilized in the same way to 
evaluate or monitor the cellular growth disregulation that is associated with 
cancer. Alternatively, just as PSA polypeptides are used to generate antibodies 

25 specific for PSA which can then be used to observe the presence and/or the level 
of PSA proteins in methods of monitoring PSA protein overexpression (see e.g. 
Stephan et al, Urology 55(4):560-3 (2000)) or the metastasis of prostate cells (see 
e.g. Alanen et al., Pathol. Res. Pract. 192(3):233-7 (1996)), the HOMPS 
polypeptides described herein can be utilized to generate antibodies for use in 

30 detecting HOMPS overexpression as seen in cells expected of exhibiting some 
form of growth disregulation. 

Just as Her-2 and PSA polynucleotide fragments and polynucleotide 
variants are employed by skilled artisans for use in methods of monitoring this 
molecule, HOMPS polynucleotide fragments and polynucleotide variants can also 
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be used in an analogous manner. In particular, typical PSA polynucleotides used 
in methods of monitoring this molecule are probes or primers which consist of 
fragments of the PSA cDNA sequence. Illustrating this, primers used to PCR 
amplify a PSA polynucleotide must include less than the whole PSA sequence to 
5 function in the polymerase chain reaction. In the context of such PCR reactions, 
skilled artisans generally create a variety of different polynucleotide fragments 
that can be used as primers in order to amplify different portions of a 
polynucleotide of interest or to optimize amplification reactions (see e.g. 
Caetano-Anolles, G. Bio techniques 25(3): 472-476, 478-480 (1998); Robertson et 

10 al., Methods Mol. Biol. 98:121-154 (1998)). In addition, in order to facilitate their 
use by medical practitioners, variant polynucleotide sequences are typically used 
as primers and probes for the corresponding mRNAs in PCR and Northern 
analyses (see e.g. Sawai et al., Fetal Diagn. Ther. 1996 Nov-Dec;l 1(6):407-13 and 
Current Protocols In Molecular Biology, Volume 2, Unit 2, Frederick M. Ausubul 

15 et al. eds., 1995)). Polynucleotide fragments and variants are typically useful in 
this context as long as they have the common attribute or characteristic of being 
capable of binding to a target polynucleotide sequence (e.g. the HOMPS 
polynucleotide shown in Figure 2) under conditions of high stringency. 

Just as Her-2 and PSA polypeptide fragments and polypeptide variants 

20 are employed by skilled artisans for use in methods of monitoring this molecule, 
HOMPS polypeptide fragments and polypeptide variants can also be used in an 
analogous manner. In particular, typical PSA polypeptides used in methods of 
monitoring this molecule are fragments of the PSA protein which contain an 
epitope that can be recognized by an antibody which will specifically bind to the 

25 PSA protein. This practice of using polypeptide fragments or polypeptide 
variants used to generate antibodies (such as anti-PSA antibodies) is typical in the 
art with a wide variety of systems such as fusion proteins being used by 
practitioners (see e.g. Current Protocols In Molecular Biology, Volume 2, Unit 
16, Frederick M. Ausubul et al. eds., 1995). In this context, each of the variety of 

30 epitopes in a protein of interest functions to provide the architecture upon which 
the antibody is generated. Typically, skilled artisans generally create a variety of 
different polypeptide fragments that can be used in order to generate antibodies 
specific for different portions of a polypeptide of interest (see e.g. U.S. Patent 
No. 5,840,501 and U.S. Patent No. 5,939,533). For example it may be preferable 
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to utilize a polypeptide comprising one of the HOMPS biological motifs 
discussed below. Polypeptide fragments and variants are typically useful in this 
context as long as they have the common attribute or characteristic of having an 
epitope capable of generating an antibody specific for a target polypeptide 
5 sequence (e.g. the HOMPS polypeptide shown in Figure 2). 

As shown herein, the HOMPS polynucleotides and polypeptides (as well 
as the HOMPS polynucleotide probes and anti-HOMPS antibodies used to 
identify the presence of these molecules) exhibit specific properties that can make 
them useful in examining cancerous cells or tissues. The described diagnostic 

10 assays that measures the presence of HOMPS gene products, in order to provide 
evidence of growth disregulation are particularly useful in identifying potential 
candidates for preventive measures or further monitoring, as has been done so 
successfully with Her~2 and PSA (see e.g. Scheurle et al., Anticancer Res. 2000 
May-Jun;20(3B):2091-6; Fontana et al, Anticancer Res 1994 Sep- 

15 Oct;14(5B):2099-104 and Sahin, Adv Anat Pathol 2000 May;7(3): 158-66). 

HOMPS EMBODIMENTS 

As disclosed herein, the invention is directed to Her-2/neu 
overexpression modulated proteins (HOMPS) genes and HOMPS gene products 

20 as well as HOMPS antibodies and assays for detecting these molecules. 
Typically, the invention encompasses HOMPS proteins as well as the 
polynucleotides which encode HOMPS proteins. Accordingly, any nucleic acid 
sequence which encodes the amino acid sequence of HOMPS can be used to 
generate recombinant molecules which express HOMPS. In a particular 

25 embodiment, the invention encompasses the polynucleotide "comprising the 
nucleic acid sequence of Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10. The 
invention encompasses HOMPS variants which retain biological or other 
functional activity of HOMPS. A preferred HOMPS variant is one having at least 
80%, and more preferably 90%, amino acid sequence identity to the HOMPS 

30 amino acid sequence of Figure 2, Figure 4, Figure 6, Figure 9 or Figure 11. A 
most preferred HOMPS variant is one having at least 95% amino acid sequence 
identity to the amino acid sequence of Figure 2, Figure 4, Figure 6, Figure 9 or 
Figure 11. 
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It will be appreciated by those skilled in the art that as a result of the 
degeneracy of the genetic code, a multitude of nucleotide sequences encoding 
HOMPS, some bearing minimal homology to the nucleotide sequences of any 
known and naturally occurring gene, may be produced.. Thus, the invention 
5 contemplates each and every possible variation of nucleotide sequence that could 
be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as 
applied to the nucleotide sequence of naturally occurring HOMPS, and all such 
variations are to be considered as being specifically disclosed. 

10 Although nucleotide sequences which encode HOMPS and its variants 

are preferably capable of hybridizing to the nucleotide sequence of the naturally 
occurring HOMPS under appropriately selected conditions of stringency, it .may 
be advantageous to produce nucleotide sequences encoding HOMPS or its 
derivatives possessing a substantially different codon usage; Codons may be 

15 selected to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with 
which particular codons are utilized by the host. Other reasons for substantially 
altering the nucleotide sequence encoding HOMPS and its derivatives without 
altering the encoded amino acid sequences include the production of RNA 

20 transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences, or 
portions thereof, which encode HOMPS and its derivatives, entirely by synthetic 
chemistry. After production, the synthetic sequence may be inserted into any of 

25 the many available expression vectors and cell systems using reagents that are 
well known in the art at the time of the filing of this application. Moreover, 
synthetic chemistry may be used to introduce mutations into a sequence encoding 
HOMPS or any portion thereof. 

Also encompassed by the invention are polynucleotide sequences that are 

30 capable of hybridizing to the claimed nucleotide sequences, and in particular, 
those shown in Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10, under various 
conditions of stringency. Hybridization conditions are based on the melting 
temperature (Tm) of the nucleic acid binding complex or probe, as taught in 
Wahl, G. M. and S. L. Berger (1987; Methods Enzymol. 152:399-407) and 
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Kimmel, A. R. (1987; Methods Enzymol. 152:507-511), and may be used at a 
defined stringency. 

Altered nucleic acid sequences encoding HOMPS which are 
encompassed by the invention include deletions, insertions, or substitutions of 
5 different nucleotides resulting in a polynucleotide that encodes the same or a 
functionally equivalent HOMPS. The encoded protein may also contain 
. deletions, insertions, or substitutions of amino acid residues which produce a 
silent change and result in a functionally equivalent HOMPS. Deliberate amino 
acid substitutions may be made on the basis of similarity, in polarity, charge, 

10 solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues as long as the biological activity of HOMPS is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid; 
positively charged amino acids may include lysine and arginine; and amino acids 
with uncharged polar head groups having similar hydrophilicity values may 

1 5 include leucine, isoleucine, and valine; glycine and alanine; asparagine and 
glutamine; serine and threonine; phenylalanine and tyrosine. 

As discussed above, the invention is directed to Her-2/neu 
overexpression modulated proteins (HOMPS). The typical embodiments of the 
invention discussed herein (e.g. polynucleotides, polypeptides, antibodies and 

20 assays for HOMPS gene products etc.) are directed to all of the HOMPS genes 
and gene products (i.e. H13, H14, H17, H41, H63 and C40). In descriptions of 
the invention provided herein, embodiments of a single HOMPS gene are used 
(fot example the H41 gene) to illustrate typical embodiments of the invention 
that apply to all of the HOMPS molecules provided herein. In this context, 

25 ardsans understand that the use of a single HOMPS molecule in illustrative 
typical embodiments common to all of the HOMPS molecules eliminates 
unnecessary redundancy in the description of the invention. 

One aspect of the invention provides polynucleotides corresponding or 
complementary to all or part of a HOMPS gene, mllNA, and/or coding 

30 sequence, preferably in isolated form, including polynucleotides encoding a 
HOMPS protein and fragments thereof, DNA, RNA, DNA/RNA hybrid, and 
related molecules, polynucleotides or oligonucleotides complementary to a 
HOMPS gene or mRNA sequence or a part thereof, and polynucleotides or 
oligonucleotides that hybridize to a HOMPS gene, mRNA, or to a HOMPS 
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encoding polynucleotide (collectively, "HOMPS polynucleotides"). As used 
herein, the HOMPS gene and protein is meant to include the HOMPS genes and 
proteins specifically described herein. 

One illustrative embodiment of a typical HOMPS polynucleotide is a 
5 polynucleotide having the H41 sequence shown in Figure 5 (SEQ ID NO: 5). A 
H41 polynucleotide may comprise a polynucleotide having the nucleotide 
sequence of human H41 as shown in FIG. 5 (SEQ ID NO: 5), wherein T can 
also be U; a polynucleotide that encodes all or part of the H41 protein; a 
sequence complementary to the foregoing; or a polynucleotide fragment of any 
10 of the foregoing. Another embodiment comprises a polynucleotide that is 
capable of hybridizing under stringent hybridization conditions to the human 
H41 cDNA shown in FIG. 5 (SEQ ID NO: 5) or to a polynucleotide fragment 
thereof. 

Typical embodiments of the invention disclosed herein include H41 

15 polynucleotides containing specific portions of the H41 mRNA sequence (and 
those which are complementary to such sequences) such as those that encode the 
protein and fragments thereof. For example, representative embodiments of the 
invention disclosed herein include: polynucleotides encoding about amino acid 1 
to about amino acid 10 of the H41 protein shown in FIG. 5 (SEQ ID NO: 5), 

20 polynucleotides encoding about amino acid 20 to about amino acid 30 of the 
H41 protein shown in FIG. 5 (SEQ ID NO: 5), polynucleotides encoding about 
amino acid 30 to about amino acid 40 of the H41 protein shown in FIG. 5 (SEQ 
ID NO: 5), polynucleotides encoding about amino acid 40 to about amino acid 
50 of the H41 protein shown in FIG. 5 (SEQ ID NO: 5), polynucleotides 

25 encoding about amino acid 50 to about amino acid 60 of the H41 protein shown 
in FIG. 5 (SEQ ID NO: 5), polynucleotides encoding about amino acid 60 to 
about amino acid 70 of the H41 protein shown in FIG. 5 (SEQ ID NO: 5), 
polynucleotides encoding about amino acid 70 to about amino acid 80 of the 
H41 protein shown in FIG. 5 (SEQ ID NO: '5), polynucleotides encoding about 

30 amino acid 80 to about amino acid 90 of the H41 protein shown in FIG. 5 (SEQ 
ID NO: 5) and polynucleotides encoding about amino acid 90 to about amino 
acid 100 of the H41 protein shown in FIG. 5 (SEQ ID NO: 5), etc. Following 
this scheme, polynucleotides encoding portions of the amino acid sequence of 
amino acids 100-258 of the H41 protein are typical embodiments of the 
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invention. Polynucleotides encoding larger portions of the H41 protein are also 
contemplated. For example polynucleotides encoding from about amino acid 1 
(or 20 or 30 or 40 etc.) to about amino acid 20, (or 30, or 40 or 50 etc.) of the 
H41 protein shown in FIG. 5 (SEQ ID NO: 5) may be generated by a variety of 
5 techniques well known in the art. 

Additional illustrative embodiments of H41 polynucleotides include 
embodiments consisting of a polynucleotide having the sequence as shown in 
FIG. 5 (SEQ ID NO: 5) from about nucleotide residue number 1 through about 
nucleotide residue number 500, from about nucleotide residue number 500' 

10 through about nucleotide residue number 1000 and from about nucleotide 
residue number 500 through about nucleotide residue number 1000 and from 
about nucleotide residue number 1000 through about nucleotide residue number 
1500 and from about nucleotide residue number 1500 through about nucleotide 
residue number 2000 and from about nucleotide residue number 2000 through 

15 about nucleotide residue number 2500 and from about nucleotide residue 
number 2500 through about nucleotide residue number 3000 and from about 
nucleotide residue number 3000 through about nucleotide residue number 3346. 
These polynucleotide fragments can be of any size an include any portion of the 
H41 sequence as shown in FIG. 5 (SEQ ID NO: 5), for example a 

20 polynucleotide having the sequence as shown in FIG. 5 (SEQ ID NO: 5) from 
about nucleotide residue number 324 through about nucleotide residue number 
2248. 

Another aspect of the present invention provides H41 proteins and 
polypeptide fragments thereof. The H41 proteins of the invention include those 

25 specifically identified herein, as well as allelic variants, conservative substitution 
variants and homologs that can be isolated/generated and characterized without 
undue experimentation following the methods outlined below. Fusion proteins that 
combine parts of different H41 proteins or fragments thereof, as well as fusion 
proteins of a H41 protein and a heterologous polypeptide are also included. Such 

30 H41 proteins will be collectively referred to as the H41 proteins, the proteins of the 
invention, or H41. As used herein, the term "H41 polypeptide" refers to a 
polypeptide fragment or a H41 protein of at least 6 amino acids, preferably at least 
15 amino acids. 
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Specific embodiments of H41 proteins comprise a polypeptide having the 
amino acid sequence of human H41 as shown in FIG. 6 (SEQ ID NO: 6). 
Alternatively, embodiments of H41 proteins comprise variant polypeptides having 
alterations in the amino acid sequence of human H41 as shown in FIG. 6 (SEQ ID 
5 NO: 6). 

In general, naturally occurring allelic variants of human HOMPS such as 
H41 will share a high degree of structural identity and homology (e.g., 90% or more 
identity). Typically, allelic variants of the HOMPS proteins will contain conservative 
amino acid substitutions within the HOMPS sequences described herein or will 

10 contain a substitution of an amino acid from a corresponding position in a HOMPS 
homologue. One class of HOMPS allelic variants will be proteins that share a high 
degree of homology with at least a small region of a particular HOMPS amino acid 
sequence, but will further contain a radical departure from the sequence, such as a 
non-conservative substitution, truncation, insertion or frame shift 

15 Conservative amino acid substitutions can frequendy be made in a 

protein without altering either the conformation or the function of the protein. 
Such changes include substituting any of isoleucine (I), valine (V), and leucine (L) 
for any other of these hydrophobic amino acids; aspartic acid (D) for glutamic 
acid (E) and vice versa; glutamine (Q) for asparagine (N) and vice versa; and 

20 serine (S) for threonine (T) and vice versa. Other substitutions can also be 
considered conservative, depending on the environment of the particular amino 
acid and its role in the three-dimensional structure of the protein. For example, 
glycine (G) and alanine (A) can frequendy be interchangeable, as can alanine (A) 
and valine (V). Methionine (M), which is relatively hydrophobic, can frequendy 

25 be interchanged with leucine and isoleucine, and sometimes with valine. Lysine 
(K) and arginine (R) are frequendy interchangeable in locations in which the 
significant feature of the amino acid residue is its charge and the differing pK's of 
these two amino acid residues are not significant. Still other changes can be 
considered "conservative" in particular environments. 

30 Embodiments of the invention disclosed herein include a wide variety of 

art accepted variants of HOMPS proteins such as polypeptides having amino acid 
insertions, deletions and substitutions. HOMPS variants can be made using 
methods known in the art such as site-directed mutagenesis, alanine scanning, 
and PCR mutagenesis. Site-directed mutagenesis (Carter et al., 1986, Nucl. Acids 
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Res. 13:4331; Zoller et al, 1987, Nucl. Acids Res. 10:6487), cassette mutagenesis 
(Wells et al., 1985, Gene 34:315), restriction selection mutagenesis (Wells et aL, 
1986, Philos. Trans. R. Soc. London Ser. A, 317:415) or other known techniques 
can be performed on the cloned DNA to produce the HOMPS variant DNA. 
5 Scanning amino acid analysis can also be employed to identify one or more 
amino acids along a contiguous sequence. Among the preferred scanning amino 
acids are relatively small, neutral amino acids. Such amino acids include alanine, 
glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid 
among this group because it eliminates the side-chain beyond the beta-carbon 

10 and is less likely to alter the main-chain conformation of the variant. Alanine is 
also typically preferred because it is the most common amino acid. Further, it is 
frequendy found in both buried and exposed positions (Creighton, The Proteins, 
(W.H. Freeman & Co., N.Y.); Chothia, 1976, J. Mol. Biol, 150:1). If alanine 
substitution does not yield adequate amounts of variant, an isosteric amino acid 

15 can be used. 

As defined herein, HOMPS variants have the distmguishing attribute of 
having at least one epitope in common with a HOMPS protein (such as the H41 
protein having the amino acid sequence of FIG. 6 (SEQ ID NO: 6)), such that an 
antibody that specifically binds to a HOMPS variant will also specifically bind to 

20 the HOMPS protein (such as the HOMPS protein having the amino acid 
sequence of FIG. 6 (SEQ ID NO: 6)). Using H41 as an illustrative example, a 
polypeptide ceases to be a variant of the H41 protein shown in FIG. 6 (SEQ ID 
NO: 6) when it no longer contains an epitope capable of being recognized by an 
antibody that specifically binds to a H41 HOMPS protein. Those skilled in the 

25 art understand that antibodies that recognize proteins bind to epitopes of varying 
size, and a grouping of the order of about six amino acids, contiguous or not, is 
regarded as a typical number of amino acids in a niinimal epitope. See e.g. 
Hebbes et al., Mol Immunol (1989) 26(9):865-73; Schwartz et al., J Immunol 
(1985) 135(4):2598-608. As there are approximately 20 amino acids that can be 

30 included at a given position within the niinimal 6 amino acid epitope, the odds of 
such an epitope occurring by chance are about 20 6 or about 1 in 64 million. 
Another specific class of HOMPS protein variants shares 90% or more identity 
with the amino acid sequence of FIG. 6 (SEQ ID NO: 6). 
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As discussed above, embodiments of the claimed invention include, 
polypeptides containing less than the 258 amino acid sequence of the H41 
piotein shown in FIG. 6 (SEQ ID NO: 6) (and the polynucleotides encoding 
such polypeptides). For example, representative embodiments of the invention 
5 disclosed herein include polypeptides consisting of about amino acid 1 to about 
amino acid 10 of the H41 protein shown in FIG, 6 . (SEQ ID NO: 6), 
polypeptides consisting of about amino acid 20 to about amino acid 30 of the 
H41 protein shown in FIG. 6 (SEQ ID NO: 6), polypeptides consisting of about 
amino acid 30 to about amino acid 40 of the H41 protein shown in FIG. 6 (SEQ 

10 ID NO: 6), polypeptides consisting of about amino acid 40 to about amino acid 
50 of the H41 protein shown in FIG. 6 (SEQ ID NO: 6), polypeptides consisting 
of about amino acid 50 to about amino acid 60 of the H41 protein shown in 
FIG. 6 (SEQ ID NO: 6), polypeptides consisting of about amino acid 60 to 
about amino acid 70 of the H41 protein shown in FIG. 6 (SEQ ID NO: 6), 

15 polypeptides consisting of about amino acid 70 to about arnino acid 80 of the 
H41 protein shown in FIG. 6 (SEQ ID NO: 6), polypeptides consisting of about 
amino acid 80 to about amino acid 90 of the H41 protein shown in FIG. 6 (SEQ 
ID NO: 6) and polypeptides consisting of about amino acid 90 to about amino 
acid 100 of the H41 protein shown in FIG. 6 (SEQ ID NO: 6), etc. Following 

20 this scheme, polypeptides consisting of portions of the amino acid sequence of 
amino acids 100-258 of the H41 protein are typical embodiments of the 
invention. Polypeptides consisting of larger portions of the H41 protein are also 
contemplated. For example polypeptides consisting of about arnino acid 1 (or 20 
or 30 or 40 etc.) to about amino acid 20, (or 30, or 40 or 50 etc.) of the H41 

25 protein shown in FIG. 6 (SEQ ID NO: 6) may be generated by a variety of 
techniques well known in the art. 

Also included within the scope of the present invention are alleles of the 
genes encoding HOMPS. As used herein, an "allele" or "allelic sequence" is an 
alternative form of the gene which may result from at least one mutation in the 

30 nucleic acid sequence. Alleles may result in altered mRNAs or polypeptides 
whose structure or function may or may not be altered. Any given gene may have 
none, one, or many allelic forms. Common mutational changes which give rise to 
alleles are generally ascribed to natural deletions, additions, or substitutions of 
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nucleotides. Each of these types of changes may occur alone, or in combination 
with the others, one or more times in a given sequence. 

Methods for DNA sequencing which are weU known and generally available in 
the art may be used to practice any embodiments of the invention. The methods 
5 may employ such enzymes as the Klenow fragment of DNA polymerase I, 
SEQUENASE (US Biochemical Corp, Cleveland, Ohio), Taq polymerase (Perkin 
Elmer), thermostable T7 polymerase Amersham Pharmacia Biotech (Pisctaway 
N. J. ), or combinations of recombinant polymerases and proofreading 
exonucleases such as the ELONGASE Amplification System marketed by Life 

10 Technologies (Gaithersburg, Md. ). 

Preferably, the process is automated with machines such as the Hamilton Micro 
Lab 2200 (Hamilton, Reno, Nev. ), Peltier Thermal Cycler (PTC200; MJ 
Research, Watertown, Mass. ) and the ABI 377 DNA sequencers (Perkin Elmer). 
The nucleic acid sequences encoding HOMPS may be extended utilizing 

15 a partial nucleotide sequence and employing various methods known in the art to 
detect upstream sequences such as promoters and regulatory elements. For 
example, one method which may be employed, "restriction -site" PCR, uses 
universal primers to retrieve unknown sequence adjacent to a known locus 
(Sarkar, G. (1993) PCR Methods Applic. 2:318-322). In particular, genomic DNA 

20 is first amplified in the presence of primer to linker sequence and a primer 
specific to the known region. The amplified sequences are then subjected to a 
second round of PCR with the same linker primer and another specific primer 
internal to the first one. Products of each round of PCR are transcribed with an 
appropriate RNA polymerase and sequenced using reverse transcriptase. 

25 Inverse PCR may also be used to amplify or extend sequences using 

divergent primers based on a known region (Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186). The primers may be designed using OLIGO 4. 06 Primer Analysis 
software (National Biosciences Inc. , Plymouth, Minn. ), or another appropriate 
program, to be 22-30 nucleotides in length, to have a GC content of 50% or 

30 more, and to anneal to the target sequence at temperatures about 68°-72° C The 
method uses several restriction enzymes to generate a suitable fragment in the 
known region of a gene. The fragment is then circularized by intramolecular 
ligation and used as a PCR template. 
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Another method which may be used is capture PCR which involves PCR 
amplification of DNA fragments adjacent to a known sequence in human and 
yeast artificial chromosome DNA (Lagerstrom, M. et al. (1991) PCR Methods 
Applic. 1:111-119). In this method, multiple restriction enzyme digestions and 
5 ligations may also be used to place an engineered double-stranded sequence into 
an unknown portion of the DNA molecule before performing PCR. 

Another method which may be used to retrieve unknown sequences is 
that of Parker, J. D. et al. (1991; Nucleic Acids Res. 19:3055-3060). Additionally, 
one may use PCR, nested primers, and PROMOTERFINDER libraries 
10 (Clontech, Palo Alto, Calif. ) to walk genomic DNA. This process avoids the 
need to screen libraries and is useful in finding intron/exon junctions. 

When screening for full-length cDNAs, it is preferable to use libraries 
that have been size-selected to include larger cDNAs. Also, random-primed 
libraries are preferable, in that they will contain more sequences which contain 
15 the 5* regions of genes. Use of a randomly primed library may be especially 
preferable for situations in which an oligo d(T) library does not yield a full-length 
cDNA. 

Genomic libraries may be useful for extension of sequence into the 5* and 3' non- 
transcribed regulatory regions. 

20 Capillary electrophoresis systems which are commercially available may 

be used to analyze the size or confirm the nucleotide sequence of sequencing or 
PCR products. In particular, capillary sequencing may employ flowable polymers 
for electrophoretic separation, four different fluorescent dyes (one for each 
nucleotide) which are laser activated, and detection of the emitted wavelengths by 

25 a charge coupled device camera. Output/light intensity may be converted to 
electrical signal using appropriate software (e. g. GENOTYPER and 
SEQUENCE NAVIGATOR, Perkin Elmer) and the entire process from loading 
of samples to computer analysis and electronic data display may be computer 
controlled. 

30 Capillary electrophoresis is especially preferable for the sequencing of 

small pieces of DNA which might be present in limited amounts in a particular 
sample. 

In another embodiment of the invention, polynucleotide sequences or 
fragments thereof which encode HOMPS, or fusion proteins or functional 
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equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of HOMPS in appropriate host cells. Due to the inherent degeneracy 
of the genetic code, other DNA sequences which encode substantially the same 
or a functionally equivalent amino acid sequence may be produced and these 
5 sequences may be used to clone and express HOMPS. 

As will be understood by those of skill in the art, it may be advantageous 
to produce HOMPS-encoding nucleotide sequences possessing non-naturally 
occurring codons. For example, codons preferred by a particular prokaryotic or 
eukaryoric host can be selected to increase the rate of protein expression or to 

10 produce a recombinant RNA transcript having desirable properties, such as a 
half-life which is longer than that of a transcript generated from the naturally 
occurring sequence. 

The nucleotide sequences of the present invention can be engineered 
using methods generally known in the art in order to alter HOMPS encoding 

15 sequences for a variety of reasons, including but not limited to, alterations which 
modify the cloning, processing, and/or expression of the gene product DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and 
synthetic oligonucleotides may be used to engineer the nucleotide sequences. For 
example, site-directed mutagenesis may be used to insert new restriction sites, 

20 alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences encoding HOMPS may be ligated to a 
heterologous sequence to encode a fusion protein. For example, to screen 

25 peptide libraries for inhibitors of HOMPS activity, it may be useful to encode a 
chimeric HOMPS protein that can be recognized by a commercially available 
antibody. A fusion protein may also be engineered to contain a cleavage site 
located between the HOMPS encoding sequence and the heterologous protein 
sequence, so that HOMPS may be cleaved and purified away from the 

30 heterologous moiety. 

In another embodiment, sequences encoding HOMPS may be 
synthesized, in whole or in part, using chemical methods well known in the art 
(see Caruthers, M. H. et al. (1980) Nucl. Acids Res. Symp. Ser. 215-223, Horn, T. 
et al. (1980) Nucl. Acids Res. Symp. Ser. 225-232). Alternatively, the protein itself 
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may be produced using chemical methods to synthesize the amino acid sequence 
of HOMPS, or a portion thereof. For example, peptide synthesis can be 
performed using various solid-phase techniques (Roberge, J. Y. et al. (1995) 
Science 269:202-204) and automated synthesis may be achieved, for example, 
5 using the ABI 431 A Peptide Synthesizer (Perkin Elmer). 

The newly synthesized peptide may be substantially- purified by 
preparadve high performance liquid chromatography (e. g. , Creighton, T. (1983) 
Proteins, Structures and Molecular Principles, WH Freeman and Co. , New York, 
N. Y. ). The composition of the synthetic peptides may be confirmed by amino 
10 acid analysis or sequencing (e. g. , the Edman degradation procedure; Creighton, 
supra). 

Additionally, the amino acid sequence of HOMPS, or any part thereof, may be 
altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant 
15 polypeptide. 

In order to express a biologically active HOMPS, the nucleotide 
sequences encoding HOMPS or functional equivalents, may be inserted into 
appropriate expression vector, i. e. , a vector which contains the necessary 
elements for the transcription and translation of the inserted coding sequence. 

20 Methods which are well known to those skilled in the art may be used to 

construct expression vectors containing sequences encoding HOMPS and 
appropriate transcriptional and translational control elements. These methods 
include in vitro recombinant DNA techniques, synthetic techniques, and in vivo 
genetic recombination. Such techniques are described in Sambrook, J. et al. 

25 (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, 
Plainview, N. Y. , and Ausubel, F. M. et al (1989) Current Protocols in Molecular 
Biology, John Wiley & Sons, New York. N. Y. 

A variety of expression vector/host systems may be utilized to contain and 
express sequences encoding HOMPS. These include, but are not limited to, 
30 microorganisms such as bacteria transformed with recombinant bacteriophage, 
plasmid, or cosmid DNA expression vectors; yeast transformed with yeast 
expression vectors; insect cell systems infected with virus expression vectors (e. g. 
, baculovirus); plant cell systems transformed with virus expression vectors (e. g. , 
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cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or with bacterial 
expression vectors (e. g. , Ti or pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" are those non- 
translated regions of the vector-enhancers, promoters, 5 1 and 3' untranslated 
5 regions-which interact with host cellular proteins to carry out transcription and 
translation. Such elements may vary in their strength and specificity. 

Depending on the vector system and host utilized, any number of suitable 
transcription and translation elements, including constitutive and inducible 
promoters, may be used. For example, when cloning in bacterial systems, 
10 inducible promoters such as the hybrid lacZ promoter of the PBLUESCRIPT 
phagemid (Stratagene, Lajolla, Calif. ) or PSPORT1 plasmid (Gibco BRL) and 
the like may be used. The baculovirus polyhedrin promoter may be used in insect 
cells. Promoters or enhancers derived from the genomes of plant cells (e. g. , heat 
shock, RUBISCO; and storage protein genes) or from plant viruses (e. g. , viral 
15 promoters or leader sequences) may be cloned into the vector. In mammalian cell 
systems, promoters from mammalian genes or from mammalian viruses are 
preferable. If it is necessary to generate a cell line that contains multiple copies of 
the sequence encoding HOMPS, vectors based on SV40 or EBV may be used 
with an appropriate selectable marker. 
20 In bacterial systems, a number of expression vectors may be selected 

depending upon the use intended for HOMPS. For example, when large 
quantities of HOMPS are needed for the induction of antibodies, vectors which 
direct high level expression of fusion proteins that are readily purified may be 
used. Such vectors include, but are not limited to, the multifunctional E. coli 
25 cloning and expression vectors such as BLUESCRIPT (Stratagene), in which the 
sequence encoding HOMPS may be ligated into the vector in frame with 
sequences for the ammo-terminal Met and the subsequent 7 residues of P- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. 
and S. M. Schuster (1989) J. Biol. Chem. 264:5503-5509); and the like. pGEX 
30 vectors (Promega, Madison, Wis. ) may also be used to express foreign 
polypeptides' as fusion proteins with glutathione S-transferase (GST). In general, 
such fusion proteins are soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in the presence of 
free glutathione. Proteins made in such systems may be designed to include 
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heparin, thrombin, or factor XA protease cleavage sites so that the cloned 
polypeptide of interest can be released from the GST moiety at will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and 
5 PGH may be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) 
Methods Enzymol. 153:516-544. 

In cases where plant expression vectors are used, the expression of 
sequences encoding HOMPS may be driven by any of a number of promoters. 
For example, viral promoters such as the 35S and 19S promoters of CaMV may 

10 be used alone or in combination with the omega leader sequence from TMV 
(Takamatsu, N. (1987) EMBO J. 6:307-311. Alternatively, plant promoters such 
as the small subunit of RUBISCO or heat shock promoters may be used 
(Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 
224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). 

15 These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. Such techniques are described 
in a number of generally available reviews (see, for example, Hobbs, S. or Murry, 
L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, 
New York, N. Y. ; pp. 191-196). 

20 An insect system may also be used to express HOMPS. For example, in 

one such system, Autographa californica nuclear polyhedrosis virus (AcNPV) is 
used as a vector to express foreign genes in Spodoptera frugiperda cells or in 
Trichoplusia larvae. The sequences encoding HOMPS may be cloned into a non- 
essential region of the virus, such as the polyhedrin gene, and placed under 

25 control of the polyhedrin promoter. Successful insertion of HOMPS will render 
the polyhedrin gene inactive and produce recombinant virus lacking coat protein. 
The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which HOMPS may be expressed (Engelhard, E. 
K. et al (1 994) Proc. Nat. Acad. Sci. 91 :3224-3227). 

30 In mammalian host cells, a number of viral-based expression systems may 

be utilized. In cases where an adenovirus is used as an expression vector, 
sequences encoding HOMPS may be ligated into an adenovirus 
transcription/ translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome 
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may be used to obtain a viable virus which is capable of expressing HOMPS in 
infected host cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 81:3655- 
3659). In addition, transcription enhancers, such as the Rous sarcoma virus 
(RSV) enhancer, may be used to increase expression in mammalian host cells. 
5 Specific initiation signals may also be used to achieve more efficient 

translation of sequences encoding HOMPS. Such signals include the ATG 
initiation codon and adjacent sequences. In cases where sequences encoding 
HOMPS, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational 

10 control signals may be needed. However, in cases where only coding sequence, or 
a portion thereof, is inserted, exogenous translational control signals including 
the ATG initiation codon should be provided. Furthermore, the initiation codon 
should be in the correct reading frame to ensure translation of the entire insert. 
Exogenous translational elements and initiation codons may be of various 

15 origins, both natural and synthetic. The efficiency of expression may be enhanced 
by the inclusion of enhancers which are appropriate for the particular cell system 
which is used, such as those described in the literature (Scharf, D. et al. (1994) 
Results Probi. Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate the 

20 expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not 
limited to, acetylation, carboxylation. glycosylation, phosphorylation, lipidation, 
and acylation. Post-translational processing which cleaves a "prepro" form of the 
protein may also be used to facilitate correct insertion, folding and/or function. 

25 Different host cells such as CHO, HeLa, MDCK, HEK293, and WI38, which 
have specific cellular machinery and characteristic mechanisms for such post- 
translational activities, may be chosen to ensure the correct modification and 
processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 

30 expression is preferred. For example, cell lines which stably express HOMPS may 
be transformed using expression vectors which may contain viral origins of 
replication and/or endogenous expression elements and a selectable marker gene 
on the same or on a separate vector. Following the introduction of the vector, 
cells may be allowed to grow for 1-2 days in an enriched media before they are 
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switched to selective media. The purpose of the selectable marker is to confer 
resistance to selection, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably 
transformed cells may be proliferated using tissue culture techniques appropriate 
5 to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus 
thymidine kinase (Wigler, M. et al. (1977) Cell 11:223-32) and adenine 
phosphoribosyltransferase (Lowy, I. et al. (1990) Cell 22:817-23) genes which can' 

10 be employed in tk" or aprt' cells, respectively. Also, antimetabolite, antibiotic or 
herbicide resistance can be used as the basis for selection; for example, dhfr 
which confers resistance to methotrexate (Wigler, M. et al. (1980) Proc. Natl. 
Acad. Sci. 77:3567-70); npt, which confers resistance to the aminoglycosides, 
neomycin and G-418 (Colbere-Garapin, F. et al (1981) J. Mol. Biol. 150:1-14); 

15 and als or pat, which confer resistance to chlorsulfuron and phosphinotricin 
acetyltransferase, respectively (Murry, supra). Additional selectable genes have 
been described, for example, trpB, which allows cells to utili2e indole in place of 
tryptophan, or hisD, which allows cells to utilize histinol in place of histidine 
(Hartman, S. C. and R. C Mulligan (1988) Proc. Nad. Acad. Sci. 85:8047-51). 

20 Recendy, the use of visible markers has gained popularity with such markers as 
anthocyanins, P glucuronidase and its substrate GUS, and luciferase and its 
substrate luciferin, being widely used not only to identify trans formants, but also 
to quantify the amount of transient or stable protein expression attributable to a 
specific vector system (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121- 

25 131). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
confirmed. For example, if the sequence encoding HOMPS is inserted within a 
marker gene sequence, recombinant cells containing sequences encoding 
30 HOMPS can be identified by the absence of marker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding HOMPS under 
the control of a single promoter. Expression of the marker gene in response to 
induction or selection usually indicates expression of the tandem gene as well. 
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Alternatively, host cells which contain the nucleic acid sequence encoding 
HOMPS and express HOMPS may be identified by a variety of procedures 
known to those of skill in the art. These procedures include, but are not limited 
to, DNA-DNA or DNA-RNA hybridizations and protein bioassay or 
5 immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein. 

The presence of polynucleotide sequences encoding HOMPS can be 
detected by DNA-DNA or DNA-RNA hybridization or amplification using 
probes or portions or fragments of polynucleotides encoding HOMPS. Nucleic 

10 acid amplification based assays involve the use of oligonucleotides or oligomers 
based on the sequences encoding HOMPS to detect transformants containing 
DNA or RNA encoding HOMPS. As used herein "oligonucleotides" or. 
"oligomers" refer to a nucleic acid sequence of at least about 10 nucleotides and 
as many as about 60 nucleotides, preferably about 15 to 30 nucleotides, and more 

15 preferably about 20-25 nucleotides, which can be used as a probe or amplimer. 

A variety of protocols for detecting and measuring the expression of 
HOMPS, using either polyclonal or monoclonal antibodies specific for the 
protein are known in the art Examples include enzyme-linked immunosorbent 
assay (ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting 

20 (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on HOMPS is preferred, but 
a competitive binding assay may be employed. These and other assays are 
described, among other places, in Hampton, R. et aL (1990; Serological Methods, 
a Laboratory Manual, APS Press, St Paul. Minn. ) and Maddox, D. E. et aL (1983; 

25 J. Exp. Med. 158:1211-1216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid assays. 
Means for producing labeled hybridization or PGR probes for detecting 
sequences related to polynucleotides encoding HOMPS include oligolabeling, 

30 nick translation, end-labeling or PCR amplification using a labeled nucleotide. 

Alternatively, the sequences encoding HOMPS, or any portions thereof 
may be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the art, are commercially available, and may be used, to synthesize 
RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, 



43 



WO 01/25250 



PCT/US00/27649 



T3, or SP6 and labeled nucleotides. These procedures may be conducted using a 
variety of commercially available kits Amersham Pharmacia Biotech, Promega, 
and US Biochemical Corp. Suitable reporter molecules or labels, which may be 
used, include radionuclides, enzymes, fluorescent, chemiluminescent, or 
5 chromogenic agents as well as substrates, cofactors, inhibitors, magnetic particles, 
and the like. 

Host cells transformed with nucleotide sequences encoding HOMPS may 
be cultured under conditions suitable for the expression and recovery of the 
protein from cell culture. The protein produced by a recombinant cell may be 

10 secreted or contained intracellularly depending on the sequence and/or the 
vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode HOMPS may be designed to contain 
signal sequences which direct secretion of HOMPS through a prokaryotic or 
eukaryotic cell membrane. Other recombinant constructions may be used to join 

15 sequences encoding HOMPS to nucleotide sequence encoding a polypeptide 
domain which will facilitate purification of soluble proteins. Such purification 
facilitating domains include, but are not limited to, metal chelating peptides such 
as his tidine- tryptophan modules that allow purification on immobilized metals, 
protein A domains that allow purification on immobilized immunoglobulin, and 

20 the domain utilized in the FLAGS extension /affinity purification system 
(Immunex Corp. , Seattle, Wash. ). The inclusion of cleavable linker sequences 
such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 
Calif. ) between the purification domain and HOMPS may be used to facilitate 
purification. 

25 One such expression vector provides for expression of a fusion protein 

containing HOMPS and a nucleic acid encoding 6 his tidine residues preceding a 
thioredoxin or an enterokinase cleavage site. The his tidine residues facilitate 
purification on IMIAC (immobilized metal ion affinity chromatography) as 
described in Porath, J. et al. (1992, Prot Exp. Purif. 3: 263-281) while the 

30 enterokinase cleavage site provides a means for purifying HOMPS from the 
fusion protein. A discussion of vectors which contain fusion proteins is provided 
in Kroll, D. j. et aL (1993; DNA Cell Biol. 12:441-453). 

In addition to recombinant production fragments of HOMPS may be 
produced by direct peptide synthesis using solid-phase techniques (Merrifield J. 
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(1963) J. Am. Chem. Soc. 85:2149-2154), Protein synthesis may be performed 
using manual techniques or by automation. Automated synthesis may be 
achieved, for example, using Applied Biosystems 431A Peptide Synthesizer 
(Perkin Elmer). Various fragments of HOMPS may be chemically synthesized 
5 separately and combined using chemical methods to produce the full length 
molecule. 

Other specifically contemplated embodiments of the invention disclosed 
herein are genomic DNA, cDNAs, ribozymes, and antisense molecules, as well as 
nucleic acid molecules based on an alternative backbone or including alternative 

10 bases, whether derived from natural sources or synthesized. For example, 
antisense molecules can be RNAs or other molecules, including peptide nucleic 
acids (PNAs) or non-nucleic acid molecules such as phosphorothioate 
derivatives, that specifically bind DNA or RNA in a base pair-dependent manner. 
A skilled artisan can readily obtain these classes of nucleic acid molecules using the 

15 HOMPS polynucleotides and polynucleotide sequences disclosed herein. 

Antisense technology entails the administration of exogenous 
oligonucleotides that bind to a target polynucleotide located within the cells. The 
term "antisense" refers to the fact that such oligonucleotides ate complementary 
to their intracellular targets, e.g., HOMPS. See for example, Jack Cohen, 1988, 

20 OLIGODEOXYNUCLEOTIDES, Antisense Inhibitors of Gene Expression, 
CRC Press; and Synthesis 1:1-5 (1988). The HOMPS antisense oligonucleotides 
of the present invention include derivatives such as S-oligonucleotides 
(phosphorothioate derivatives or S-oligos, see, Jack Cohen, supra), which exhibit 
enhanced cancer cell growth inhibitory action. S-oligos (nucleoside 

25 phosphorothioates) are isoelectronic analogs of an oligonucleotide (O-oligo) in 
which a nonbridging oxygen atom of the phosphate group is replaced by a sulfur 
atom. The S-oligos of the present invention may be prepared by treatment of the 
corresponding O-oligos with 3H-l,2-benzodithiol-3 -one- 1,1 -dioxide, which is a 
sulfur transfer reagent. See Iyer, R. P. et al, 1990, J. Org. Chem. 55:4693-4698; 

30 and Iyer, R. P. et al., 1990, J. Am. Chem. Soc. 112:1253-1254, the disclosures of 
which are fully incorporated by reference herein. Additional HOMPS antisense 
oligonucleotides of the present invention include morpholino antisense 
oligonucleotides known in the art (see e.g. Partridge et al., 1996, Antisense & 
Nucleic Acid Drug Development 6: 169-175). 
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The HOMPS antisense oligonucleotides of the present invention typically 
may be RNA or DNA that is complementary to and stably hybridizes with the 
first 100 N-terminal codons or last 100 C-terminal codons of the HOMPS 
genomic sequence or the corresponding rnRNA. While absolute 
5 complementarity is not required, high degrees of complementarity are preferred. 
Use of an oligonucleotide complementary to this region allows for the selective 
hybridization to HOMPS mRNA and not to mRNA specifying other regulatory 
subunits of protein kinase. Preferably, the HOMPS antisense oligonucleotides of 
the present invention are a 15 to 30-mer fragment of the antisense DNA 

10 molecule having a sequence that hybridizes to HOMPS mRNA. Optionally, 
HOMPS antisense oligonucleotide is a 30-mer oligonucleotide that is 
complementary to a region in the first 10 N-terrninal codons and last 10 C- 
terminal codons of HOMPS, Alternatively, the antisense molecules are modified 
to employ ribozymes in the inhibition of HOMPS expression (L. A. Couture & 

15 D. T. Stinchcomb, 1996, Trends Genet 12: 510-515). 

Further specific embodiments of this aspect of the invention include 
primers and primer pairs, which allow the specific amplification of the 
polynucleotides of the invention or of any specific parts thereof, and probes that 
selectively or specifically hybridize to nucleic acid molecules of the invention or 

20 to any part thereof. Probes may be labeled with a detectable marker, such as, for 
. example, a radioisotope, fluorescent compound, bioluminescent compound, a 
chemnuminescent compound, metal chelator or enzyme. Such probes and 
primers can be used to detect the presence of a HOMPS polynucleotide in a sample 
and as a means for detecting a cell expressing a HOMPS protein. 

25 Illustrative examples of such probes include polypeptides comprising all or 

part of the human HOMPS H41 cDNA sequences shown in FIG. 5. Examples of 
primer pairs capable of specifically amplifying HOMPS mRNAs are also described 
in the Examples that follow. As will be understood by the skilled artisan, a great 
many different primers and probes may be prepared based on the sequences 

30 provided herein and used effectively to amplify and/or detect a HOMPS mRNA. 

As used herein, a polynucleotide is said to be "isolated" when it is 
substantially separated from contaminant polynucleotides that correspond or are 
complementary to genes other than the HOMPS gene or that encode polypeptides 
other than HOMPS gene product or fragments thereof. A skilled artisan can readily 
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employ nucleic acid isolation procedures to obtain an isolated HOMPS 
polynucleotide. 

The HOMPS polynucleotides of the invention are useful for a variety of 
purposes, including but not limited to their use as probes and primers for the 
5 amplification and/or detection of the HOMPS gene(s), mRNA(s), or fragments 
thereof; as reagents for the evaluation and/or diagnosis and/or prognosis of 
cancers; as coding sequences capable of directing the expression of HOMPS 
polypeptides; as tools for modulating or inhibiting the expression of the HOMPS 
gene(s) and/or translation of the HOMPS transcripts); and as therapeutic agents. 

10 Expression vectors derived from retroviruses, adenovirus, herpes or 

vaccinia viruses, or from various bacterial plasmids may be used for delivery of 
nucleotide sequences to the targeted organ, tissue or cell population. Methods 
which are well known to those skilled in the art can be used to construct 
recombinant vectors which will express antisense molecules complementary to 

15 the polynucleotides of the gene encoding HOMPS. These techniques are 
described both in Sambrook et al. (supra) and in Ausubel et al. (supra). 

Genes encoding HOMPS can be turned off by transforming a cell or 
tissue with expression vectors which express high levels of a polynucleotide or 
fragment thereof which encodes HOMPS. Such constructs may be used to 

20 introduce untranslatable sense or antisense sequences into a cell. Even in the 
absence of integration into the DNA, such vectors may continue to transcribe 
RNA molecules until they are disabled by endogenous nucleases. Transient 
expression may last for a month or more with a non-replicating vector and even 
longer if appropriate replication elements are part of the vector system. 

25 As mentioned above, modifications of gene expression can be obtained 

by designing antisense molecules, DNA, RNA, or PNA, to the control regions of 
the gene encoding HOMPS, i. e. the promoters, enhancers, and introns. 
Oligonucleotides derived from the transcription initiation site, e. g. , between 
positions -10 and +10 from the start site, are preferred. Similarly, inhibition can 

30 be achieved using "triple helix" base-pairing methodology. Triple helix pairing is 
useful because it causes inhibition of the ability of the double helix to open 
sufficiently for the binding of polymerases, transcription factors, or regulatory 
molecules. Recent therapeutic advances using triplex DNA have been described 
in the literature (Gee, J. E. et al (1994) In: Huber, B. E. and B. I. Carr, Molecular 
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and Immunologic Approaches, Futura Publishing Co. , Mt. Kisco, N. Y. ). The 
antisense molecules may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the 
5 specific cleavage of RNA. The mechanism of ribozyme action involves sequence- 
. specific hybridization of the ribozyme molecule to complementary target RNA; 
followed by endonucleolytic cleavage. Examples which may be used include 
engineered hammerhead motif ribozyme molecules that can specifically and 
efficiently catalyze endonucleolytic cleavage of sequences encoding HOMPS. 

10 Specific ribozyme cleavage sites within any potential RNA target are 

initially identified by scanning the target molecule for ribozyme cleavage sites 
which include the following sequences: GUA, GUU, and GUC. Once identified, 
short RNA sequences of between 15 and 20 ribonucleotides corresponding to 
the region of the target gene containing the cleavage site may be evaluated for 

15 secondary structural features which may render the oligonucleotide inoperable. 
The suitability of candidate targets may also be evaluated by testing accessibility 
to hybridization with complementary oligonucleotides using ribonuclease 
protection assays. 

Antisense molecules and ribozymes of the invention may be prepared by 
20 any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid 
phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be 
generated by in vitro and in vivo transcription of DNA sequences encoding 
HOMPS. 

25 Such DNA sequences may be incorporated into a wide variety of vectors 

with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these 
cDNA constructs that synthesize antisense RNA constitutively or inducibly can 
be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and 

30 half-life. 

Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/or 3' ends of the molecule or the use of 
phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within 
the backbone of the molecule. This concept is inherent in the production of 
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PNAs and can be extended in all of these molecules by the inclusion of 
nontraditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, 
methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, 
thymine, and uridine which are hot as easily recognized by endogenous 
5 endonucieases. 

Many methods for introducing vectors into cells or tissues are available 
and equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, 
vectors may be introduced into stem cells taken from the patient and clonally 
propagated for autologous transplant back into that same patient. Delivery by 
10 transfection and by liposome injections may be achieved using methods which 
are well known in the art. Where appropriate, the methods described above may 
be applied to any subject in need of such therapy, including, for example, 
mammals such as dogs, cats, cows, horses, rabbits, monkeys, and most 
preferably, humans. 

15 As discussed in detail below, in another embodiment, antibodies which 

specifically bind HOMPS may be used for the evaluation and characterization of 
conditions or diseases characterized by expression of HOMPS, or in assays to 
monitor patients being treated with HOMPS, agonists, antagonists or inhibitors. 
I lie antibodies useful for diagnostic purposes may be prepared in the same 

20 manner as those described above for therapeutics. Diagnostic assays for HOMPS 
include methods which utilize the antibody and a label to detect HOMPS in 
human body fluids or extracts of cells or tissues. The antibodies may be used 
with or without modification, and may be labeled by joining them, either 
covalendy or non-covalendy, with a reporter molecule. A wide variety of reporter 

25 molecules which are known in the art may be used, several of which are 
described above. 

A variety of protocols including ELISA, RIA, and FACS for measuring 
HOMPS are known in the art and provide a basis for diagnosing altered or 
abnormal levels of HOMPS expression. Normal or standard values for HOMPS 
30 expression are established by combining body fluids or cell extracts taken from 
normal mammalian subjects, preferably human, with antibody to HOMPS under 
conditions suitable for complex formation. The amount of standard complex 
formation may be quantified by various methods, but preferably by photometric, 
means. Quantities of HOMPS expressed in subject samples, control and diseases 
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from biopsied tissues are compared with the standard values. Deviation between 
standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding 
HOMPS may be used for diagnostic purposes. The polynucleotides which may 
5 be used include oligonucleotide sequences, antisense RNA and DNA molecules, 
and PNAs. The polynucleotides may be used to detect and quantitate gene 
expression in biopsied tissues in which expression of HOMPS may be correlated 
with disease. The diagnostic assay may be used to distinguish between absence, 
presence, and excess expression of HOMPS, and to monitor regulation of 

10 HOMPS levels during therapeutic intervention. 

In one aspect, hybridization with PGR probes which are capable of 
detecting polynucleotide sequences, including genomic sequences, encoding 
HOMPS or closely related molecules, may be used to identify nucleic acid 
sequences which encode HOMPS. The specificity of the probe, whether it is 

15 made from a highly specific region, e. g. , 10 unique nucleotides in the 5* 
regulatory region, or a less specific region, e. g. , especially in the 3' coding region, 
and the stringency of the hybridization or amplification (maximal, high, 
intermediate, or low) will determine whether the probe identifies only naturally 
occurring sequences encoding HOMPS, alleles, or related sequences. 

20 Probes may also be used for the detection of related sequences, and 

should preferably contain at least 50% of the nucleotides from any of the 
HOMPS encoding sequences. The hybridization probes of the subject invention 
may be DNA or RNA and derived from the nucleotide sequence of Figure 1, 
Figure 3, Figure 5, Figure 8 or Figure 10 or from genomic sequence including 

25 promoter, enhancer elements, and introns of the naturally occurring HOMPS. 

Means for producing specific hybridization probes for DNAs encoding 
HOMPS include the cloning of nucleic acid sequences encoding HOMPS or 
HOMPS derivatives into vectors for the production of mRNA probes. Such 
vectors are known in the art, commercially available, and may be used to 

30 synthesize RNA probes in vitro by means of the addition of the appropriate 
RNA polymerases and the appropriate labeled nucleotides. Hybridization probes 
may be labeled by a variety of reporter groups, for example, radionuclides such as 
32 P or 35 S, or enzymatic labels, such as alkaline phosphatase coupled to the probe 
via avidin/biotin coupling systems, and the like. 
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Polynucleotide sequences encoding HOMPS may be used for the 
evaluation and characterization of disorders associated with the expression of 
HOMPS. Examples of such disorders include: various types of cancer such as 
breast cancer. The polynucleotide sequences encoding HOMPS may be used in 
5 Southern or northern analysis, dot blot, or other membrane-based technologies; 
in PCR technologies; or in dip stick, pin, ELISA or chip assays utilizing fluids or 
tissues from patient biopsies to detect altered HOMPS expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding HOMPS may 

10 be useful in assays that detect activation or induction of various cancers such as 
breast cancer, particularly those mentioned above. The nucleotide sequences 
encoding HOMPS may be labeled by standard methods, and added to a fluid or 
tissue sample from a patient under conditions suitable for the formation of 
hybridization complexes. After a suitable incubation period, the sample is washed 

15 and the signal is quantitated and compared with a standard value. If the amount 
of signal in the biopsied or extracted sample is significantly altered from that of a 
comparable control sample, the nucleotide sequences have hybridized with 
nucleotide sequences in the sample, and the presence of altered levels of 
nucleotide sequences encoding HOMPS in the sample indicates the presence of 

20 the associated disease. Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regimen in animal studies, in clinical trials, or in 
monitoring the treatment of an individual patient 

In order to provide a basis for the evaluation and characterization of 
disease associated with expression of HOMPS, a normal or standard profile for 

25 expression is established. This may be accomplished by combining body fluids or 
cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, which encodes HOMPS, under conditions 
suitable for hybridization or amplification. Standard hybridization may be 
quantified by comparing the values obtained from normal subjects with those 

30 from an experiment where a known amount of a substantially purified 
polynucleotide is used. Standard values obtained from normal samples may be 
compared with values obtained from samples from patients who are symptomatic 
for disease. Deviation between standard and subject values is used to establish 
the presence of disease. 
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Once disease is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to evaluate whether the 
level of expression in the patient begins to approximate that which is observed in 
the normal patient The results obtained from successive assays may be used to 
5 show the efficacy of treatment over a period ranging from several days to 
months. 

With respect to cancer, the presence of a relatively high amount of 
transcript in biopsied tissue from an individual may indicate a predisposition for 
the development of the disease, or may provide a means for detecting the disease 

10 prior to the appearance of actual clinical symptoms. A more definitive evaluation 
and characterization of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the 
development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the 

15 sequences encoding HOMPS may involve the use of PCR. Such oligomers may 
be chemically synthesized, generated enzymatically, or produced from a 
recombinant source. Oligomers will preferably consist of two nucleotide 
sequences, one with sense orientation (S^V) and another with antisense (3 f <— 5'), 
employed under optimized conditions for identification of a specific gene or 

20 condition. The same two oligomers, nested sets of oligomers, or even a 
degenerate pool of oligomers may be employed under less stringent conditions 
for detection and/ or quantitation of closely related DNA or RNA sequences. 

Another aspect of the present invention relates to methods for detecting 
HOMPS polynucleotides and HOMPS proteins and variants thereof, as well as 

25 methods for identifying a cell that expresses HOMPS. The expression profile of 
HOMPS makes it a potential diagnostic marker for local and/ or metastasized 
disease. In this context, the status of HOMPS gene products may provide 
information useful for predicting a variety of factors including susceptibility to 
advanced stage disease, rate of progression, and/or tumor aggressiveness. As 

30 discussed in detail below, the status of HOMPS gene products in patient samples 
may be analyzed by a variety protocols that are well known in the art including 
immuriohistochemical analysis, the variety of Northern blotting techniques 
including in situ hybridization, RT-PCR analysis (for example on laser capture 
micro-dissected samples), western blot analysis and tissue array analysis. 
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More particularly, the invention provides assays for the detection of 
HOMPS polynucleotides in a biological sample, such as breast or uterine tissue, 
serum, bone, prostate, and other tissues, urine, semen, cell preparations, and the 
like. Detectable HOMPS polynucleotides include, for example, a HOMPS gene or 
5 fragments thereof," HOMPS rnRNA, alternative splice variant HOMPS mRNAs, 
and recombinant DNA or RNA molecules containing a HOMPS polynucleotide. A 
number of methods for amplifying and/or detecting the presence of HOMPS 
polynucleotides are well known in the art and may be employed in the practice of 
this aspect of the invention. 

10 In one embodiment, a method for detecting a HOMPS rnRNA in a 

biological sample comprises producing cDNA from the sample by reverse 
transcription using at least one primer, amplifying the cDNA so produced using a 
HOMPS polynucleotides as sense and antisense primers to amplify HOMPS 
cDNAs therein; and detecting the presence of the amplified HOMPS cDNA. 

15 Optionally, the sequence of the amplified HOMPS cDNA can be determined. In 
another embodiment, a method of detecting a HOMPS gene in a biological 
sample comprises first isolating genomic DNA from the sample; amplifying the 
isolated genomic DNA using HOMPS polynucleotides as sense and antisense 
primers to amplify the HOMPS gene therein; and detecting the presence of the 

20 amplified HOMPS gene. Any number of appropriate sense and antisense probe 
combinations may be designed from the nucleotide sequences provided for the 
HOMPS (e.g. H41 as shown in FIG. 6) and used for this purpose. 

The invention also provides assays for detecting the presence of a HOMPS 
protein in a tissue of other biological sample such as breast or uterine tissue, serum, 

25 bone, prostate, and other tissues, urine, semen, cell preparations, and the like. 
Methods for detecting a HOMPS protein are also well known and include, for 
example, immunoprecipitation, immunohistochemical analysis, Western Blot 
analysis, molecular binding assays, ELISA, ELIFA and the like. For example, in 
one embodiment, a method of detecting the presence of a HOMPS protein in a 

30 biological sample comprises first contacting the sample with a HOMPS antibody, 
a HOMPS-reactive fragment thereof, or a recombinant protein containing an 
antigen binding region of a HOMPS antibody; and then detecting the binding of 
HOMPS protein in the sample thereto. 
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Methods for identifying a cell that expresses HOMPS are also provided. In 
one embodiment, an assay for identifying a cell that expresses a HOMPS gene 
comprises detecting the presence of HOMPS mRNA in the celL Methods for the 
detection of particular mRNAs in cells are well known and include, for example, 
5 hybridization assays using complementary DNA probes (such - as in situ 
hybridization using labeled HOMPS riboprobes, Northern blot and related 
techniques) and various nucleic acid amplification assays (such as RT-PCR using 
complementary primers specific for HOMPS, and other amplification type 
detection methods, such as, for example, branched DNA, SISBA, TMA and the 
10 like). Alternatively, an assay for identifying a cell that expresses a HOMPS gene 
comprises detecting the presence of HOMPS protein in the cell or secreted by the 
cell. Various methods for the detection of proteins are well known in the art and 
may be employed for the detection of HOMPS proteins and HOMPS expressing 
cells. 

15 HOMPS expression analysis may also be useful as a tool for identifying and 

evaluating agents that modulate HOMPS gene expression. Identification of a 
molecule or biological agent that could inhibit HOMPS expression or over- 
expression in cancer cells may be of therapeutic value. Such an agent may be 
identified by using a screen that quantifies HOMPS expression by RT-PCR, 

20 nucleic acid hybridization or antibody binding. 

MONITORING THE STATUS OF HOMPS 

Assays that evaluate the status of the HOMPS gene and HOMPS gene 
products in an individual may provide information on the growth or oncogenic 
potential of a biological sample from this individual. For example, because 

25 HOMPS are modulated by Her-2, an oncogene associated with a number of 
cancers, assays that evaluate the relative levels of HOMPS mRNA transcripts or 
proteins in a biological sample may be used to evaluate diseases associated with 
HOMPS disregulation such as cancer and may provide prognostic information 
useful in defining appropriate therapeutic options. 

30 Because HOMPS expression is modulated, for example, in cells which 

crerexpress the Her-2 oncogene, the expression status of HOMPS can provide 
information useful for deterrnining information including the presence, stage and 
location of displasic, precancerous and cancerous cells, predicting susceptibility to 
various stages of disease, and/or for gauging tumor aggressiveness. Consequendy, 
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an important aspect of the invention is directed to the various molecular methods 
for examining the status of HOMPS in biological samples such as those from 
individuals suffering from, or suspected of suffering from a pathology 
characterized by disregulated cellular growth such as cancer. 
5 Oncogenesis is known to be a mulristep process where cellular growth 

becomes progressively disregulated and cells progress from a normal 
physiological state to precancerous and then cancerous states (see e.g. Alers et al, 
Lab Invest. 77(5): 437-438 (1997) and Isaacs et al, Cancer Surv. 23: 19-32 
(1995)). In this context, examining a biological sample for evidence of 

10 disregulated cell growth can allow the early detection of such aberrant cellular 
physiology before a pathology such as cancer has progressed to a stage at which 
therapeutic options are more limited. In such examinations, the status of 
HOMPS in a biological sample of interest (such as one suspected of having 
disregulated cell growth) can be compared, for example, to the status of HOMPS 

15 in a corresponding normal sample (e.g. a sample from that individual (or 
alternatively another individual) that is not effected by a pathology, for example 
one not suspected of having disregulated cell growth) with alterations in the 
status of HOMPS in the biological sample of interest (as compared to the normal 
sample) providing evidence of disregulated cellular growth. In addition to using a 

20 biological sample that is not effected by a pathology as a normal sample, one can 
also use a predetermined normative value such as a predetermined normal level 
of mRNA expression (see e.g. Grever et al., J. Comp. Neurol 1996 Dec 
9;376(2):306-14 and U.S. patent No. 5,837,501) to compare HOMPS in normal 
versus suspect samples. 

25 The term "status" in this context is used according to its art accepted 

meaning and refers to the condition or state of a gene and its products. As 
specifically described herein, the status of HOMPS can be evaluated by a number 
of parameters known in the art. Typically an alteration in the status of HOMPS 
comprises a change in the location of HOMPS expressing cells (as occurs in 

30 metastases) and/or an increase in HOMPS mRNA and/or protein expression. 

Typically, skilled artisans use a number of parameters to evaluate the 
condition or state of a gene and its products. These include, but are not limited to 
the location of expressed gene products (including the location of HOMPS 
expressing cells) as well as the, level, and biological activity of expressed gene 
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products (such as HOMPS mRNA polynucleotides and polypeptides). Alterations 
in the status of HOMPS can be evaluated by a wide variety of methodologies well 
known in the art, typically those discussed below. Typically an alteration in the 
status of HOMPS comprises a change in the location of HOMPS and/or 
5 HOMPS expressing cells and/or an increase in HOMPS mRNA and/or protein 
expression. 

As discussed in detail herein, in order to identify a condition or 
phenomenon associated with disregulated cell growth, the status of HOMPS in a 
biological sample may be evaluated by a number of methods utilized by skilled 

10 artisans including, but not limited to genomic Southern analysis (to examine, for 
example perturbations in the HOMPS gene), northerns and/or PGR analysis of 
HOMPS mRNA (to examine, for example alterations in the polynucleotide 
sequences or expression levels of HOMPS mRNAs), and western and/or 
immunohistochemical analysis (to examine, for example alterations in 

15 polypeptide sequences, alterations in polypeptide localization within a sample, 
alterations in expression levels of HOMPS proteins and/or associations of 
HOMPS proteins with polypeptide binding partners). Detectable HOMPS 
polynucleotides include, for example, a HOMPS gene or fragments thereof, 
HOMPS mRNA, alternative splice variants HOMPS mRNAs, and recombinant 

20 DNA or RNA molecules containing a HOMPS polynucleotide. 

The expression profile of HOMPS makes them potential markers for 
disregulated cell growth. In particular, the status of HOMPS may provide 
information useful for predicting susceptibility to particular disease stages, 
progression, and/or tumor aggressiveness. The invention provides methods and 

25 assays for determining HOMPS status and diagnosing cancers that express 
HOMPS. HOMPS status in patient samples may be analyzed by a number of 
means well known in the art, including without limitation, immunohistochemical 
analysis, in situ hybridization, RT-PCR analysis on laser capture micro-dissected 
samples, western blot analysis of clinical samples and cell lines, and tissue array 

30 analysis. Typical protocols for evaluating the status of the HOMPS gene and gene 
products can be found, for example in Ausubul et aL eds., 1995, Current Protocols 
In Molecular Biology, Units 2 [Northern Blotting], 4 [Southern Blotting], 15 
[Immunoblotting] and 18 [PCR Analysis]. 
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As described above, the status of HOMPS in a biological sample can be 
examined by a number of well known procedures in the art. For example, the 
status of HOMPS in a biological sample taken from a specific location in the 
body can be examined by evaluating the sample for the presence or absence of 
5 HOMPS expressing cells (e.g. those that express HOMPS mRNAs or proteins). 
This examination can provide evidence of disregulated cellular growth for 
example, when HOMPS expressing cells are found in a biological sample that 
does not normally contain such cells (such as a lymph node). Such alterations in 
the status of HOMPS in a biological sample are often associated with disregulated 

10 cellular growth. Specifically, one indicator of disregulated cellular growth is the 
metastases of cancer cells from an organ of origin (such as the breast) to a 
different area of the body (such as a lymph node). In this context, evidence of 
disregulated cellular growth is important for example because occult lymph node 
metastases can be detected in a substantial proportion of patients with cancers, 

15 and such metastases are associated with known predictors of disease progression 
(see e.g. Freeman et al., J Urol 1995 Aug;154(2 Pt l):474-8). 

HOMPS ANTIBODIES 

The term "antibody" is used in the broadest sense and specifically covers 
single anti-HOMPS monoclonal antibodies (including agonist, antagonist and 

20 neutralizing antibodies) and anti-HOMPS antibody compositions with polyepitopic 
specificity. The term "monoclonal antibody" (mAb) as used herein refers to an 
. antibody obtained from a population of substantially homogeneous antibodies, i.e. 
the antibodies comprising the individual population are identical except for possible 
naturally-occurring mutations that may be present in minor amounts. 

25 Another aspect of the invention provides antibodies that bind to HOMPS 

proteins and polypeptides. The most preferred antibodies will specifically bind to a 
HOMPS protein and will not bind (or will bind weakly) to non-HOMPS proteins 
and polypeptides. Anti-HOMPS antibodies that are particulady contemplated 
include monoclonal and polyclonal antibodies as well as fragments containing the 

30 antigen binding domain and/or one or more complementarity determining regions 
of these antibodies. As used herein, an antibody fragment is defined as at least a 
portion of the variable region of the immunoglobulin molecule that binds to its 
target, i.e., the antigen binding region. 



57 



WO 01/25250 



PCT/US00/27649 



HOMPS antibodies of the invention may be particularly useful in imaging 
methodologies. Intracellularly expressed antibodies (e.g., single chain antibodies) 
may be therapeutically useful in treating cancers in which the expression of 
HOMPS is involved, such as for example Her-2 overexpressing cancers. Such 
5 antibodies may- be useful in the analysis, treatment, evaluation and 
characterization, and/or prognosis of other cancers, to the extent HOMPS is also 
expressed or overexpressed in other types of cancers. 

The invention also provides various immunological assays useful for the 
detection and quantification of HOMPS and mutant HOMPS proteins and 

10 polypeptides. Such assays generally comprise one or more HOMPS antibodies 
capable of recognizing and binding a HOMPS or mutant HOMPS protein, as 
appropriate, and may be performed within various immunological assay formats 
well known in the . art, including but not limited to various types of 
radioimmunoassays, enzyme-linked immunosorbent assays (ELISA), enzyme-linked 

15 immunofluorescent assays (ELIFA), and the like. In addition, immunological 
imaging methods capable of detecting cancers expressing HOMPS are also 
provided by the invention, including but limited to radioscintigraphic imaging 
methods using labeled HOMPS antibodies. Such assays may be clinically useful in 
the detection, monitoring, and prognosis of HOMPS expressing cancers. 

20 HOMPS antibodies may also be used in methods for purifying HOMPS 

and mutant HOMPS proteins and polypeptides and for isolating HOMPS 
homologues and related molecules. For example, in one embodiment, the method 
of purifying a HOMPS protein comprises incubating a HOMPS antibody, which 
has been coupled to a solid matrix, with a lysate or other solution containing 

25 HOMPS under conditions that permit the HOMPS antibody to bind to HOMPS; 
washing the solid matrix to eliminate impurities; and eiuting the HOMPS from the 
coupled antibody. Other uses of the HOMPS antibodies of the invention include 
generating anti-idiotypic antibodies that mimic the HOMPS protein. 

Various methods for the preparation of antibodies are well known in the 

30 art. For example, antibodies may be prepared by immunizing a suitable mammalian 
host using a HOMPS protein, peptide, or fragment, in isolated or 
immunoconjugated form (Harlow, and Lane, eds., 1988, Antibodies: A Laboratory 
Manual, CSH Press; Harlow, 1989, Antibodies, Cold Spring Harbor Press, NY). In 
addition, fusion proteins of HOMPS may also be used, such as a HOMPS GST- 
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fusion protein. In a particular illustrative embodiment, a GST fusion protein 
comprising all or most of the open reading frame amino acid sequence of H41 as 
shown in FIG. 6 may be produced and used as an immunogen to generate 
appropriate antibodies. In another embodiment, a HOMPS peptide may be 
5 synthesized and used as an immunogen. 

In addition, naked DNA immunization techniques known in the art may be 
used (with or without purified HOMPS protein or HOMPS expressing cells) to 
generate an immune response to the encoded immunogen (for review, see Donnelly 
etal., 1997, Ann. Rev. Immunol. 15:617-648). 

10 In an illustrative embodiment, the amino acid sequence of the H41 

HOMPS protein as shown in FIG 6 may be used to select specific regions of the 
HOMPS protein for generating antibodies. For example, hydrophobicity and 
hydrophilicity analyses of the HOMPS amino acid sequence may be used to identify 
hydrophilic regions in the HOMPS structure. Regions of the HOMPS protein that 

15 show immunogenic structure, as well as other regions and domains, can readily be 
identified using various other methods known in the art, such as Chou-Fasman, 
Garnier-Robson, Kyte-Doolittle, Eisenberg, I^arplus-Schultz or Jameson-Wolf 
analysis. 

Methods for preparing a protein or polypeptide for use as an immunogen 
20 and for preparing immunogenic conjugates of a protein with a carrier such as BSA, 
KLH, or other carrier proteins are well known in the art In some circumstances, 
direct conjugation using, for example, carbpdiimide reagents may be used; in other 
instances linking reagents such as those supplied by Pierce Chemical Co., Rockford, 
IL, may be effective. Adrninistration of a HOMPS immunogen is conducted 
25 generally by injection over a suitable time period and with use of a suitable adjuvant, 
as is generally understood in the art During the immunization schedule, titers of 
antibodies can be taken to determine adequacy of antibody formation. 

HOMPS monoclonal antibodies are preferred and may be produced by 
various means well known in the art For example, immortalized cell lines that 
30 secrete a desired monoclonal antibody may be prepared using the standard 
hybridoma technology of Kohler and Milstein or modifications that immortalize 
producing B cells, as is generally known. The immortalized cell lines secreting the 
desired antibodies are screened by immunoassay in which the antigen is the 
HOMPS protein or a HOMPS fragment. When the appropriate immortalized cell 
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culture secreting the desired antibody is identified, the cells may be expanded and 
antibodies produced either from in vitro cultures or from ascites fluid. 

The antibodies or fragments may also be produced, using current 
technology, by recombinant means. Regions that bind specifically to the desired 
5 regions of the HOMPS protein can also be produced in the context of chimeric or 
CDR grafted antibodies of multiple species origin. Humanized or human HOMPS 
antibodies may also be produced and are preferred for use in therapeutic contexts. 
Methods for humanizing murine and other non-human antibodies by substituting 
one or more of the non-human antibody CDRs for corresponding human antibody 

10 sequences are well known (see for example, Jones et ai., 1986, Nature 321:522-525; 
Riechmann et al., 1988, Nature 332:323-327; Verhoeyen et al., 1988, Science 
239:1534-1536). See also, Carter et al., 1993, Proc. Natl Acad. Sci. USA 89:4285 
and Sims et al., 1993, J. Immunol. 151:2296. Methods for producing fully human 
monoclonal antibodies include phage display and transgenic methods (for review, 

15 see Vaughan et aL, 1998, Nature Biotechnology 16:535-539). 

Fully human HOMPS monoclonal antibodies may be generated using 
cloning technologies employing large human Ig gene combinatorial libraries (Le., 
phage display) (Griffiths and Hoogenboom, Building an in vitro immune system: 
human antibodies from phage display libraries. In: Clark, M., ecL, 1993, Protein 

20 Engineering of Antibody Molecules for Prophylactic and Therapeutic Applications 
in Man, Nottingham Academic, pp 45-64; Burton and Barbas, Human Antibodies 
from combinatorial libraries. Id., pp 65-82). Fully human HOMPS monoclonal 
antibodies may also be produced using transgenic mice engineered to contain 
human immunoglobulin gene loci as described in PCT Patent Application 

25 W098/24893, Kucherlapati and Jakobovits et aL, published December 3, 1997 (see 
also, Jakobovits, 1998, Exp. Opin. Invest Drugs 7(4):607-614). This method avoids 
the in vitro manipulation required with phage display technology and efficiently 
produces high affinity authentic human antibodies. 3 

Reactivity of HOMPS antibodies with a HOMPS protein may be 

30 established by a number of well known means, including western blot, 
uumunoprecipitation, ELISA, and FACS analyses using, as appropriate, HOMPS 
proteins, peptides, HOMPS-expressing cells or extracts thereof. 

A HOMPS antibody or fragment thereof of the invention may be labeled 
with a detectable marker or conjugated to a second molecule. Suitable detectable 
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markers include, but are not limited to, a radioisotope, a fluorescent compound, a 
bioluminescent compound, chemiluminescent compound, a metal chelator or an 
enzyme. A second molecule for conjugation to the HOMPS antibody can be 
selected in accordance with the intended use. For example, for therapeutic use, 
5 the second molecule can be a toxin or therapeutic agent. Further, bi-specific 
antibodies specific for two or more HOMPS epitopes may be generated using 
methods generally known in the art. Homodimeric antibodies may also be 
generated by cross-linking techniques known in the art (e.g., Wolff et al., 1993, 
Cancer Res. 53: 2560-2565). 

10 HOMPS TRANSGENIC ANIMALS 

Nucleic acids that encode HOMPS or its modified forms can also be 
used to generate either transgenic animals or "knock out" animals which, in turn, 
are useful in the development and screening of therapeutically useful reagents. A 
transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a 

15 transgene, which transgene was introduced into the animal or an ancestor of the 
animal at a prenatal, e.g., an embryonic stage. A transgene is a DNA that is 
integrated into the genome of a cell from which a transgenic animal develops. In 
one embodiment, cDNA encoding HOMPS can be used to clone genomic DNA 
encoding HOMPS in accordance with established techniques and the genomic 

20 sequences used to generate transgenic animals that contain cells that express 
DNA encoding HOMPS. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are 
described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009. Typically, 
particular cells would be targeted for HOMPS transgene incorporation with 

25 tissue-specific enhancers. Transgenic animals that include a copy of a transgene 
encoding HOMPS introduced into the germ line of the animal at an embryonic 
stage can be used to examine the effect of increased expression of DNA 
encoding HOMPS. Such animals can be used as tester animals for reagents 
thought to confer protection from, for example, pathological conditions 

30 associated with its overexpression. In accordance with this facet of the 
invention, an animal is treated with the reagent and a reduced incidence of the 
pathological condition, compared to untreated animals bearing the transgene, 
would indicate a potential therapeutic intervention for the pathological condition. 
Alternatively, non-human homologues of HOMPS can be used to 
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construct a HOMPS "knock out" animal that has a defective or altered gene 
encoding HOMPS as a result of homologous recombination between the 
endogenous gene encoding HOMPS and altered genomic DNA encoding 
HOMPS introduced into an embryonic cell of the animal For example, cDNA 
5 encoding HOMPS can be used to clone genomic DNA encoding HOMPS in 
accordance with established techniques. A portion of the genomic DNA 
encoding HOMPS can be deleted or replaced with another gene, such as a gene 
encoding a selectable marker that can be used to monitor integration. Typically, 
several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are 

10 included in the vector (see e.g., Thomas and Capecchi, 1987, Cell 51:503) for a 
description of homologous recombination vectors]. The vector is introduced 
into an embryonic stem cell line (e.g., by electroporation) and cells in which the 
introduced DNA has homologously recombined with the endogenous DNA are 
selected (see e.g., Li et al., 1992, Cell 69:915). The selected cells are then injected 

15 into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation chimeras 
(see e.g., Bradley, in Robertson, ed., 1987, Teratocarcinomas and Embryonic 
Stem Cells: A Practical Approach, (IRL, Oxford), pp. 113-152). A chimeric 
embryo can then be implanted into a suitable pseudopregnant female foster 
animal and the embryo brought to term to create a "knock out" animal. Progeny 

20 harboring the homologously recombined DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of 
the animal contain the homologously recombined DNA. Knockout animals can 
be characterized for instance, for their ability to defend against certain 
pathological conditions and for their development of pathological conditions due 

25 to absence of the HOMPS polypeptide. 

IDENTIFYING MOLECULES THAT INTERACT WITH HOMPS 

The HOMPS protein sequences disclosed herein allow the skilled artisan 
to identify molecules that interact with them via any one of a variety of art 
accepted protocols. For example one can utilize one of the variety of so-called 
30 interaction trap systems (also referred to as the "two-hybrid assay"). In such 
systems, molecules that interact reconstitute a transcription factor and direct 
expression of a reporter gene, the expression of which is then assayed. Typical 
systems identify protein-protein interactions in vivo through reconstitution of a 
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eukaryotic transcriptional activator and are disclosed for example in U.S. Patent 
Nos. 5,955,280, 5,925,523, 5,846,722 and 6,004,746. 

Alternatively one can identify molecules that interact with HOMPS 
protein sequences by screening peptide libraries. In such methods, peptides that 
5 bind to selected receptor molecules such as HOMPS are identified by screening 
libraries that encode a random or controlled collection of amino acids. Peptides 
encoded by the libraries are expressed as fusion proteins of bacteriophage coat 
proteins, and bacteriophage particles are then screened against the receptors of 
interest. Peptides having a wide variety of uses, such as therapeutic or diagnostic 

10 reagents, may thus be identified without any prior information on the structure of 
the expected ligand or receptor molecule. Typical peptide libraries and screening 
methods that can be used to identify molecules that interact with HOMPS protein 
sequences are disclosed for example in U.S. Patent Nos. 5,723,286 and 5,733,731. 
Alternatively, cell lines expressing HOMPS can be used to identify 

15 protein-protein interactions mediated by HOMPS. This possibility can be 
examined using immunoprecipitation techniques as shown by others (Hamilton, 
B.J., et al, 1999, Biochem. Biophys. Res. Commun. 261:646-51). Typically 
HOMPS protein can be immunoprecipitated from HOMPS expressing cancer 
cell lines using anti-HOMPS antibodies. Alternatively, antibodies against His-tag 

20 can be used in cell line engineered to express HOMPS (vectors mentioned 
above). The immunoprecipitated complex can be examined for protein 
association by procedures such as western blotting, 35 S-methionine labeling of 
proteins, protein microsequencing, silver staining and two dimensional gel 
electrophoresis. 

25 Related embodiments of such screening assays include methods for 

identifying small ' molecules that interact with HOMPS. Typical methods are 
discussed for example in U.S. Patent No. 5,928,868 and include methods for 
forming hybrid ligands in which at least one ligand is a small molecule. In an 
illustrative embodiments, the hybrid ligand is introduced into cells that in turn 

30 contain a first and a second expression vector. Each expression vector includes 
DNA for expressing a hybrid protein that encodes a target protein linked to a 
coding sequence for a transcriptional module. The cells further contains a 
reporter gene, the expression of which is conditioned on the proximity of the 
first and second hybrid proteins to each other, an event that occurs only if the 
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hybrid ligand binds to target sites on both hybrid proteins. Those cells that 
express the reporter gerie are selected and the unknown small molecule or the 
unknown hybrid protein is identified. 

Methods which may also be used to quantitate the expression of HOMPS 
5 include radiolabeling or biotinylating nucleotides, coamplification of a control 
nucleic acid, and standard curves onto which the experimental results are 
interpolated (Melby, P. C. et al. (1993) J. Immunol. Methods, 159:235-244; 
Duplaa, C. et al. (1993) Anal Biochem. 229-236). The speed of quantitation of 
multiple samples may be accelerated by running the assay in an ELISA format 

10 where the oligomer of interest is presented in various dilutions and a 
spectrophotometric or colorimetric response gives rapid quantitation. 

In another embodiment of the invention, the nucleic acid sequences 
which encode HOMPS may also be used to generate hybridization probes which 
are useful for mapping the naturally occurring genomic sequence. The sequences 

15 may be mapped to a particular chromosome or to a specific region of the 
chromosome using well known techniques. Such techniques include FISH, 
FACS, or artificial chromosome constructions, such as yeast artificial 
chromosomes, bacterial artificial chromosomes, bacterial PI constructions or 
single chromosome cDNA libraries as reviewed in Price, C. M. (1993) Blood Rev. 

20 7:127-134, and Trask, B. J. (1991) Trends Genet. 7:149-154. FISH (as described 
in Verma et al. (1988) Human Chromosomes: A Manual of Basic Techniques, 
Pergamon Press, New York, N.Y.) may be correlated with other physical 
chromosome mapping techniques and genetic map data. Examples of genetic 
map data can be found in the 1994 Genome Issue of Science (265:1981 f). 

25 Correlation between the location of the gene encoding HOMPS on a 

physical chromosomal map and a specific disease, or predisposition to a specific 
disease, may help delimit the region of DNA associated with that genetic disease. 
The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier, or affected individuals, 

30 In situ hybridi2ation of chromosomal preparations and physical mapping 

techniques such as linkage analysis using established chromosomal markers may 
be used for extending genetic maps. Often the placement of a gene on the 
chromosome of another mammalian species, such as mouse, may reveal 
associated markers even if the number or arm of a particular human 
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chromosome is not known. New sequences can be assigned to chromosomal 
arms, or parts thereof, by physical mapping. This provides valuable information 
to investigators searching for disease genes using positional cloning or other gene 
discovery techniques. Once the disease or syndrome has been crudely localized 
5 by genetic linkage to a particular genomic region, for example, AT to tlq22-23 
(Gatti, R. A. et al. 

(1988) Nature 336:577-580), any sequences mapping to that area may represent 
associated or regulatory genes for further investigation. The nucleotide sequence 
of the subject invention may also be used to detect differences in the 

10 chromosomal location due to translocation, inversion, etc. among normal, carrier, 
or affected individuals. 

In another embodiment of the invention, HOMPS, its catalytic or 
immunogenic fragments or oligopeptides thereof, can be used for screening 
libraries of compounds in any of a variety of drug screening techniques. The 

15 fragment employed in such screening may be free in solution, affixed to a solid 
support, borne on a cell surface, or located intxacellularly. The formation of 
binding complexes, between HOMPS and the agent being tested, may be 
measured. 

Another technique for drug screening which may be used provides for 
20 high throughput screening of compounds having suitable binding affinity to the 
protein of interest as described in published PCT application WO84/03564. In 
this method, as applied to HOMPS large numbers of different small test 
compounds are synthesized" "on a solid substrate, such as plastic pins or some 
other surface. The test compounds are reacted with HOMPS, or fragments 
25 thereof, and washed. Bound HOMPS is then detected by methods well known in 
the art Purified HOMPS can also be coated direcdy onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid 
support. 

30 In another embodiment, one may use competitive drug screening assays 

in which neutralizing antibodies capable of binding HOMPS specifically compete 
with a test compound for binding HOMPS. In this manner, the antibodies can be 
used to detect the presence of any peptide which shares one or more antigenic 
determinants with HOMPS. 
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In additional embodiments, the nucleotide sequences which encode 
HOMPS may be used in any molecular biology techniques that have yet to be 
developed, provided the new techniques rely on properties of nucleotide 
sequences that are currendy known, including, but not limited to, such properties 
5 as the triplet genetic code and specific base pair interactions. 

KITS 

For use in the applications described or suggested above, kits are also 
provided by the invention. Such kits may comprise a carrier means being 
compartmentalized to receive in close confinement one or more container means 
10 such as vials, tubes, and the like, each of the container means comprising one of 
the separate elements to be used in the method. For example, one of the 
container means may comprise a probe that is or can be detectably labeled. Such 
probe may be an antibody or polynucleotide specific for a HOMPS protein or a 
HOMPS gene or message, respectively. Where the kit utilizes nucleic acid 
15 hybridization to detect the target nucleic acid, the kit may also have containers 
containing nucleotide(s) for amplification of the target nucleic acid sequence 
and/or a container comprising a reporter-means, such as a biotin-binding 
protein, such as avidin or streptavidin, bound to a reporter molecule, such as an 
enzymatic, florescent, or radioisotope label. 
20 The kit of the invention will typically comprise the container described 

above and one or more other containers comprising materials desirable from 
a commercial and user standpoint, including buffers, diluents, filters, needles, 
syringes, and package inserts with instructions for use. A label may be present on 
the container to indicate that the composition is used for a specific therapy or non- 
25 therapeutic application, and may also indicate directions for either in vivo or in vitro 
use, such as those described above. 

EXAMPLES 

Various aspects of the invention are further described and illustrated by 
way of the several examples that follow, none . of which are intended to limit the 
30 scope of the invention. 

1. Cell Culture 
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Cells were grown in RPMI medium 1640, supplemented with 10 % fetal 
bovine serum, 2mM glutamine, and 1% penicillin G-streptomycin- fungizone 
solution. Cells were harvested at 80 % confluency for total RNA extraction. 

5 2, RNA Preparation 

Total cellular RNA was purified by guanidinium / cesium chloride 
ultracentrifugation (16). Messenger RNA was isolated by two passages through 
an oligo dT cellulose column (T3 - Collaborative Research) (17). The quality and 
mRNA composition of the resulting RNA population were confirmed by 
10 Northern blot analysis by probing with P-actin and HER-2 cDNAs. Both MCF- 
7/control and MCF-7/HER2 mRNA pools contain equivalent, basal expression 
of the endogenous HER-2 transcript whereas the transcript representing 
transfected HER-2 cDNA was present only in the MCF-7/HER2 cells. 

15 3, Construction of the cDNA Library 

Five ug of MCF-7/HER2 poly (A) + RNA was constructed into ZAP 
Express™ vector (Stratagene) according to the manufacturer's protocol using 
materials provided in the cDNA synthesis kit. The recombinant phage were 
packaged in Gigapack II Gold packaging extract (Stratagene). The packaged 
20 library was amplified one round through passage on XLl-Blue MRP host cells 
(6x1 0 5 pfu/nl titer). As determined by the X-gal/IPTG color assay, the 
background (non-recombinant) phage level was less than 0. 1 %. 

4. Differential hybridization 

25 The MCF-7/HER2 cDNA library was plated on XLl-Blue MRF host 

cells at a density of 2,000 pfu per each of eight 150 mm petri dishes. After 
plating, actin and HER-2 clones purified from the same library were each loaded 
onto four designated spots within the individual plates to be used as 
hybridization controls. The nitrocellulose filters (Millipore) were placed on the 

30 agar plates 1.5 min. for the first filter, 3 min. for the second, and 7 min. for the 
third. The phage DNA was denatured for 3 min. in a solution containing 0.5 M 
NaOH 1.5 M NaCl, neutralized for 3 min. in a solution containing 3 M NaCl 0.5 
M Tris pH 7.5, and rinsed in 2xSSC The treated filters were air-dried and baked 
at 80°C for 1 hour. 
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The radiolabeled cDNA probes (MCF7 /control and MCF-7/HER2) 
were prepared as follows. Poly (A) + RNA was randomly labeled, by using both 
random hexamers and oligo dT primers, in 20 ui solution containing 1.0 jag poly 
(A) + RNA, Ix MMLV buffer, 1 mM each of dATP, dGTP, dTTP and 0.045 mM 
5 dCTP, 100 LiCi y[ 32 P]dCTP, 0.5 ug oligo dT( 15 ), 0.2 fig random primer, 20 U 

RNase inhibitor, and 200 U Moloney murine leukemia virus (MMLV) reverse 
transcriptase. The reaction was incubated at room temperature first for 10 min. 
to allow primer annealing and further incubated at 37°C for 1 hour. Upon 
addition of 4.6 fil of 0.5 M NaOH, the reaction mix was incubated at 70°C for 20 

10 min. Incorporated counts were eluted from a spin column (Chromaspin 100 / 
Clontech) in 1XTEN buffer (0.1 M NaCl, 10 mM Tris-pH 8.0, 1 mM EDTA). 

Approximately 5.9x1 0 7 dpm counts of MCF-7/control probes were 
added to a first hybridization containing 26 ml hybridization solution and the first 
set of 8 niters obtained from each plate. Equal counts of MCF-7/HER2 probes 

15 were added to the second set of filters. The third set of filters were hybridized 
with radioactive HER-2 cDNA in order to avoid selecting the HER-2 containing 
clones. Hybridization solution contained 50% formamide, 25X Denhardts, 
sonicated salmon sperm DNA, NaPO^pH 6.8, sodium pyrophosphate, and ribo 
ATP. Prehybridization was performed at 42°C for 4 h. and hybridization at 42°C 

20 for 4 days (overnight hybridization for the third set of filters). Filters were 
washed at room temperature for 5 min. (x3) in 0.2xSSC/0.1%SDS, and at 60°C 
for 15 min. (x7) in the same solution. The washed filters were exposed with an 
intensifying screen to Kodak-XAR5 films at -70°C for various time periods. 
Autoradiograms were analyzed to compare differences in focal signal intensity 

25 between the replica filters. 

For the secondary screenings, the primary screening procedure was 
repeated except that each clone was separately plated onto a 100 mm petri dish at 
a low density of 25-50 plaques/plate . 

30 5. Probe generation for Northern hybridization 

The pBK-CMV phagemid was in vivo excised from the lamdaphage 
vector according to the manufacturer's instructions (Stiatagene). The cDNA 
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inserts were isolated from the plasmid either by restriction enzyme digestions or 
by PCR amplification using T3 and 17 sequences as primers. 

6. Northern Hybridization 

.5 Either 2 fig of poly (A) + RNA or 20 fig of total RNA was loaded onto a 

1% formaldehyde agarose gel and electrophoresed at 70 V for 4 hours. The 
RNA was transferred to a nylon membrane in lOx SSC. The purified cDNA 
inserts were random-labeled in a 50 fll reaction mix which contained 50 ng 
template, [y- 32 P] dCTP, 20 jig BSA, 6 U Klenow. Incorporated counts were 

10 eluted from a G-50 Sephadex spin column (Pharmacia). Approximately 3x10^ 
dpm counts per 1 ml hybridization solution were used. The hybridization was 
carried out in 50% formamide, 2xSSC, 0.1% SDS, 10 mg/ml salmon sperm 
DNA,'and 10% dextran sulfate, at 42°C for 16 hours. Membranes were washed 
in 2xSSC/0.1%SDS at 25°C for 10 min. (x3), and in the same . solution at 65°C 

15 for 5 min. (x2). The washed membranes were exposed with an intensifying 
screen to Kodak-XAR film at -70°C. 

7. DNA Sequencing and Computer Analysis 

Minipreparations of pBK-CMV plasmid vector (Qiawell 8 Ultra, Qiagen) 
20 were sequenced with T3/T7 promoter primers and internal primers using an 
automatic DNA sequencer (Applied Biosystems Model 373A). The sequence 
similarity search was performed using GenBank and EMBL DNA databases. 

8. In Vitro Transcription / Translation 

25 The cDNA inserts were translated into polypeptides in a TNT coupled 

reticulocyte system (Promega) according to the manufacturer's protocol ; 
Approximately lflg of purified plasmid template was transcribed and translated in 
the 50 |il reaction containing T3 RNA polymerase, rabbit reticulocyte lysate, 
[ 35 S] methionine, etc. 5 fil of the end product was aliquoted to estimate the 

30 molecular size of the in vitro translated protein using 10% SDS-PAGE and 
prestained protein size markers (Bio-Rad). 

9. Extraction of total RNA from breast tumor samples 
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Breast tumors were obtained from patients at the time of surgery as part 
of a core tissue procurement resource sponsored by the DOD breast cancer 
program. All tumor samples were snap frozen in liquid nitrogen and kept at - 
70° C before extraction of RNA. Frozen tissues were pulverized in liquid 
5 nitrogen prior to homogenization in cold 4 M guanidine thiocyanate buffer (7.5 
ml/g of tissue). The homogenates were centrifuged for 10 min. at 4°C at 8000 g 
in order to remove cell debris. RNA was sedimented through a cesium chloride 
gradient (5.7 M/2.4M CsCl2) via ultracentrifugation (18 h at 36,000 rpm, 20°C). 
The separated RNA phases were extracted with phenol-chloroform prior to wash 
10 with 100 % ethanol. The RNA pellet was precipitated by adding 2 ml of 0.4 M 
sodium acetate and 2.5 vol. of 100 % ethanol, and storing over night at -20°C. 
After ccntrifugation (20 min at 10,000 g), the pellets were dried and dissolved in 
DEPC water. 

15 10. Isolation of Differentially Expressed Genes Associated with HER- 
2/ neu Over expression 

In our first round of differential screening, 16,000 clones from the MCF- 
7/HER-2 library were analyzed. Clones showing a stronger signal intensity 
hybridized with the MCF-7/control cell cDNA probes were labeled "C" clones 

20 (CI, C2, C3, etc.), whereas those demonstrating a stronger signal intensity 
hybridized with HER-2 overexpressing cell cDNA probes were labeled "H" 
clones (HI, H2, H3, etc.). From this primary screening, a total of 127 
differentially expressed clones were isolated including 77 C clones and 50 H 
clones representing genes whose expression levels are decreased (C clones) or 

25 increased (H clones) respectively in association with HER-2 overexpression. 
Each clone was ranked according to degree of differential hybridization based on 
signal intensity ranging from more than a five fold to less than a two fold change 
based on visualization. 

Forty- three C clones and 36 H clones which demonstrated the greatest 

30 differences in signal intensity in the primary screening were taken through 
secondary screening to ensure consistent differential expression and to isolate 
pure colonies. Subsequent to this isolation, the clones were cross-hybridized to 
determine redundancy. This resulted in a total of 7 non-redundant C clones and 
12 non-redundant H clones. Finally, to confirm our screening technique, 
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differential expression patterns of the selected clones were evaluated by Northern 
blot analysis of RNA from MCF-7/control and HER-2 cells. A total of 5 C 
clones and 1 1 H clones showed expression patterns consistent with expectations 
from the differential hybridization approach while 2 C clones and 1 H clone 
5 failed to demonstrate the anticipated pattern. 

11. DNA Sequencing and Identification of the Clones 

Individual clones whose differential expression was confirmed by 
Northern blot analysis were subsequendy analyzed by DNA sequencing, and 

10 these data demonstrate that full-length cDNAs. were obtained for most of the 
clones. The differentially expressed genes were grouped into three different 
classes based on computer searches against the GenBank and EMBL data bases; 
(1) known genes with previously characterized function (2) previously identified 
genes with relatively uncharacterized function (3) novel sequences (summarized 

15 in Table 1 below). In addition, each clone was grouped into three different 
categories based on significance of their relative difference in expression (Table 
1). Even genes whose differential expression is small (~2 fold) were included if 
this difference was consistendy reproducible in multiple analyses. 

As stated above, the MCF-7/HER-2 cells behave significandy differendy 

20 than their isogenic control parental counterparts; with increases in DNA 
synthesis, cell growth in vitro, soft agar cloning efficiency and tumorigenicity. Nine 
of the differentially expressed genes identified in this study are known to be 
associated with the malignant phenotype. For example, cytokeratin 8 (C29) 
(Taniguchi, T., Fujii-Kuriyama, Y. and Muramatsu, M. (1980) Proc Natl Acad Sri U 

25 S A, 77(7), 4003-6), cytokeratin 18 (C49) (Oshima, R. G., Millan, J. L. and 
Cecena, G. (1986) Differentiation, 33(1), 61-8), and gamma actin (C72) (Erba, H. 
P., Gunning, P. and Kedes, L. (1986) Nucleic Acids Res, 14(13), 5275-94) are 
cytoskeletal proteins essential for mamtaining both cell shape and motility of 
normal cells and their expression differs from levels seen in the aberrant 

30 cytoskeleton of cancer cells (Mukhopadhyay, T. and Roth, J. A. (1996) Anticancer 
Res, 16(1), 105-12). Similarly GAPDH (H31) (Tokunaga, K., Nakamura, Y., 
Sakata, K., Fujimori, K., Ohkubo, M., Sawada, K. and Sakiyama, S. (1987) Cancer 
Res, 47(21), 5616-9) and succinyi CoA transferase (H45) (Kassovska-Bratinova, 
S., Fukao, T., Song, X. Q., Duncan, A. M., Chen, H. S., Robert, M. F., Perez- 
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Cerda, C, Ugarte, M., Chartrand, C, Vobecky, S., Kondo, N. and Mitchell, G. A. 
(1996) Am J Hum Genet, 59(3), 519-28) are involved in the metabolic pathway of 
more rapidly growing cells including cancer cells. The increased expression of 
ribosomal proteins, L8 (HI 6) (Hanes, J., Klaudiny, J., von der Kammer, H. and 
5 Scheit, K. H. (1993) Biochem Biophys Res Commun, 197(3), 1223-8) and LLrep3 
(H35) (Heller, D. L., Gianola, K. M. and Leinwand, L. A. (1988) Mot Cell Biol, 
8(7), 2797-803), is consistent with the increased rate of protein translation, 
required for cancer cell growth. Two other genes identified in our study whose 
cellular functions have been previously characterized are Cathepsin D (C31) 

10 (Faust, P. L., Kornfeld, S. and Chirgwin, J. M. (1985) Prvc Natl Acad Sci USA, 
82(15), 4910-4), an acidic lysosomal protease, and the 90 kDa heat shock protein 
(H18) (Rebbe, N. R, Ware, J., Bertina, R. M., Modrich, P. and Stafford, D. W. 
(1987) Gene, 53(2-3), 235-45) which is a chaperon protein associated with steroid 
hormone receptor genes. Both of these genes are known to be differentially 

15 expressed in malignant cells (Johnson, M. D., Torri, J. A., Lippman, M, E. and 
Dickson, R. B. (1993) Cancer Res, 53(4), 873-7; Mileo, A. M., Fanuele, M., 
Battaglia, F., Scambia, G., Benedetti-Panici, P., Mancuso, S. and Ferrini, U. (1990) 
Anticancer Res, 10(4), 903-6). 

Three genes (HI 3, HI 4, H37) found to be overexpressed in the HER-2 

20 overexpressing cells matched cDNA sequences which were previously identified 
by other investigators but not fully characterized. The HI 3 clone appears to be 
an alternate splice variant of DNA fragmentation factor (DFF) (GenBank 
accession no. U91985) (Liu, X., Zou, H., Slaughter, C. and Wang, X. (1997) Cell, 
89(2), 175-84); the first 261 amino acid sequences contained in both HI 3 and 

25 the DFF open reading frames are identical, but HI 3 lacks 70 amino acids at the 
3' end and contains 7 different amino acids in their place. The predictive arnino 
acid sequence for H13 is highly similar to the ICAD (Inhibitor of Caspase- 
Activated Dnase)- S and -L proteins (Sakahira, H., Enari, M. and Nagata, S. 
(1998) Nature, 391(6662), 96-9; Enari, M., Sakahira, H, Yokoyama, H., Okawa, 

30 IC, Iwamatsu, A. and Nagata, S. (1998) Nature, 391(6662), 43-50) (73% and 69% 
sequence identity, respectively). In order to ensure that the cDNAs cloned, 
either uncharacterized or novel, can be efficientiy translated into protein products 
of expected sizes, we performed in vitro - translation experiments. As predicted, 
the H13 cDNA was translated into the polypeptide of approximately 30 kDa 
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(Figure 13) Clone H14 is identical to DRP-1 (Density regulated protein-1 / 
GenBank accession no. AF038554) (Deyo, J. E., Chiao, P. J. and Tainsky, M. A. 
(1998) DNA Cell Biol, 17(5), 437-47) except that it lacks 285 bp at the 5* end and 
has 23 additional bp at the 3' end plus a poly (A) tail. Both the H 14 and DRP-1 
5 cDNAs are likely to be partial, 3* end sequences of a larger transcript since no 
suitable initiating codon was found in either sequences, and the HI 4 cDNA 
hybridized with an additional transcript of approximately 6.5 kb on Northern 
blot analysis. According to the EST (Expressed Sequence Tags) database 
analysis, the 5' end of the HI 4 cDNA sequence can be extended, and the ESTs 

10 covering the extended portion of the gene is designated THC202438 (deposited 
in the Tentative Human Consensus effort) (Kirkness, E. F. and Kerlavage, A. R. 
(1997) Methods Mol Biol, 69, 261-8). The H37 cDNA sequence has been 
previously deposited into GenBank as RNA binding motif protein 5 (RBM5) 
(accession no. AF091263, unpublished) found within a region reported to be 

15 homozygously deleted in lung cancer and believed to contain (a) major tumor 
suppressor gene(s) involved in a majority of small cell and non-small cell lung 
cancers (Wei, M. H., Latif, F., Bader, S., Kashuba, V., Chen, J. Y., Duh, F. M., 
Sekido, Y., Lee, C. C, Geil, L., Kuzmin, I., Zabarovsky, E., Klein, G., Zbar, B., 
Minna, J. D. and Lerman, M. I. (1996) Cancer Res, 56(7), 1487-92). The H37 

20 cDNA contains an open reading frame of 816 amino acid and is translated in vitro 
. into a predicted protein product of approximately 90 kDa (Figure 13). Analysis 
of the putative H37 protein against PROSITE protein profile databases 
recognized the presence of two RNA binding domains, located at amino acid 
residues 140-147 and 274-281, which are perfect matches with the consensus 

25 eukaryotic sequence for a putative RNA-binding region RNP-1 (Bandziulis, R. J., 
Swanson, M. S. and Dreyfuss, G. (1989) Genes Dev, 3(4), 431-7). 

Four clones (C40, HI 7, H41, H63) represented as-yet unknown genes in 
the DNA databases, and three of these (C40, H17, H41) were found to contain 
probable open reading frames. The 1750 bp long C40 clone contains a 510 

30 araino acid open reading frame (Figure 14A). The coding region of this gene 
begins with a start codon at nucleotide position 74 and has an in-frame stop 
codon at position 1604 (Figure 14A). The C40 clone was in vitro - translated into 
the predicted major protein product of approximately 55 kDa (Figure 13). The 
385 bp region (nt 568-952) of this gene is 89% identical to a GenBank transcript 
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(accession no. U56429), however,, the putative C40 protein does not share any 
significant homology to any known proteins in the databases. Examination of 
die C40 protein sequence using the GCG program Motifs revealed the presence 
of a leucine zipper motif at amino acid positions 104-125, which is a perfect 
5 match with the consensus sequence for the leucine zipper pattern (Steeg, P. S., 
Bevilacqua, G., Kopper, L., Thorgeirsson, U. P., Talmadge, J. E., Liotta, L. A. 
and Sobel, M. E. (1988) ] Natl Cancer Inst, 80(3), 200-4) (Figure 14A). 

The 1981 bp long HI 7 clone contains a consensus initiation codon 
(Kozak, M. (1991) J Biol Cbem y 266(30), 19867-70) at nucleotide 66 followed by a 

10 486 amino acid open reading frame and a 458 bp 3' untranslated region including 
a polyadenylation signal (AATAAA) (Figure 14B). As is known in the art, the 
nucleic acid sequence around the 5' proximal AUG codon is typically a Kozak 
consensus sequence where eukaryotic ribosomes initiate translation and the 
general rule the eukaryotic ribosomes initiate translation exclusively at the 5' 

15 proximal AUG codon is abrogated only under rare conditions (see e.g. Kozak 
PNAS 92(7): 2662-2666, (1995) and Kozak NAR 15(20): 8125-8148 (1987)). In 
vitro translation generated the predicted protein product of approximately 50 
kDa (Figure 13). The putative H17 protein has 39.3% identity with a C. elegans 
cDNA of unknown function (Z77667) over a 422 amino acid region (aa 61-482) 

20 and is similar to a metabolic enzyme sarcosine oxidase (AE001086) with 29.2% 
identity in a 171 amino acid region (aa 66-236) (Figure 14B). 

The 3346 bp H41 clone contains a 323 bp 5' untranslated region followed 
by an initiation codon with a Kozak consensus (Kozak, M. (1991) / Biol Chem, 
266(30), 19867-70) and an extensive, 2249 bp 3* untranslated region (Figure 14C). 

25 The 258 amino acid residues encoded by its open reading frame was translated in 
vitro into the predicted protein product of approximately 30 kDa and an 
additional protein of lower molecular weight (Figure 13). The putative H41 
protein is related to one of the "fast evolving" drosophila genes of unknown 
function (AF005858) (Schmid, K.J. and Tautz, D. (1997) Proc Natl Acad Sci U S 

30 A, 94(18), 9746-50) with 28.7% identity in a 167 amino acid region (aa 9-175) 
(Figure 14C). According to a pSORT protein database search, the H41 gene 
product is predicted to be a nuclear protein based on the presence of a nuclear 
localization signal, 4 basic amino acid (lysine) residues, at its N-terrninus (94.1% 
reliability by Reinhardt's method) (Reinhardt, A. and Hubbard, T. (1998) Nucleic 
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Acids Res, 26(9), 2230-6) (Figure 14C). For the above three novel cDNA 
sequences (C40, HI 7, H41), we could not find any ESTs which would extend our 
sequences furdier either at the 5' or 3' ends. 

Lasdy, the novel H63 clone is believed to be a partial, 3* sequence of a 
5 longer transcript because this sequence did not contain any probable open 
reading frames, and the 2068 bp DNA hybridized with a transcript of 
approximately 4.5 kb on a Northern blot. According to the EST database 
analysis, die 5' end of the H63 cDNA sequence can be extended and assembled 
as THC1 75350. 

10 Table 1: Identity of differentially expressed clones 



Clone #« 


Size* 
(bp) 


Clone Identity mRNA size* 

(kb) 


Relative difference 
in cxprcMiotr 1 




(1) Genes with previously characterized functiorT 






C29 


1777 


Keratin 8 


1.8 


U 


C31 


2053 


Cathepsin D 


2.1 


u 


C4*> 


1423 


Keratin 18 


1.4 


u 


C72 


1940 


Gamma aclin 


1.9 


I 


HUi 


933 


Ribosomal protein L8 


09 


t 


1118 


2530 


90-kDa heat-shock protein 


1.2, 2.5 


ttt 


1-13 1 


1284 


Glyccraldcbydc-3-phosphatc dehydrogenase 


1.3 


TT 


H35 


948 


LLRcp3 


0.9 


ttt 


H45 


2317 


Succinyl CoA:3-oxoacid CoA uansferasc 


1.5, 3.3, 5.3 


tt 


(2) Recently identified genes with relatively uncharactcrized function 






im 


1027 


DNA fragmentation factor (DFF) 


» 

1.0, 1.4, 3.4, 6.4 


tt 


NH 


2214 


Density Regulated Protein- 1 (DR1M) 


0.96. 2.2, 6.5 


ttt 


H37 


3091 


RNA binding motif protein 5 (RUM 5) 


1.9,3.1,6.5 


tt 


(3) Novel 


sequences 








C40 


1750 


Not previously identified 


1.8, Z6.4.9 


i 


HI7 


1981 


Not previously identified 


2.0, 4.4 


ttt 


H41 


3346 


Not previously identified 


1.9, 2.7, 3.3,4.0 


ttt 


1163 


2068 


Not previously identified 


1.9, 4.5 


t 



■Prefix "C" and "H" denotes genes whose expression level decrease and increase, respectively, in MCP-7/HER-2 cells 
c^iripared 
to the control cells. 

b *llic size of cDNAs cloned in the differential screening and used as probes for Northern blot analysis. 
•The sizes of differentially expressed transcripts on Northern autoradiograra 

^Directions of arrows indicate expression level increase (t) or decrease (I), respectively, in HER-2 overexposing cells. 
The number of arrows indicates relative difference in expression level change by visualization, i.e. t = -2 fold, tt = 
3-5 fold, ttt = >5 fold. 

•'llie nucleotide sequences reported in this study have been submitted to die GcnBank™/EMBL database and assigned 
these 

accession numbers. 
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12. Confirmation of Differential Expression in Ovarian Cancer Cell 
Counterparts 

5 To ensure that differential expression of these genes is a phenomenon 

consistently associated with HTLR-2/rzeu overexpression rather than a unique 
event restricted to a single cell line, we evaluated the expression of these genes in 
human ovarian cancer cells (CaOv-3) engineered to overexpress HER-2/ neu in a 
fashion identical to the MCF-7/HER-2 cells (Pietras, R. J., Fendly, B. M., Chazin, 

10 V. R., Pegram, M. D., Howell, S. B. and Slamon, D. J. (1994) Oncogene, 9(7), 1829- 
38; Pegram, M. D., Finn, R. S., Arzoo, K., Beryt, M., Pietras, R. J. and Slamon, D. 
J. (1997) Oncogene, 15(5), 537-47). The amount of HER-2/ neu protein expressed, 
as determined by quantitative Western blot analysis, was approximately 1.62 
pg/cell for MCF-7/HER-2 cells and 1.14 pg/cell for the CaOv-3/HER-2 cells as 

15 compared to 0.36 pg/cell and 0.41 pg/cell for the control transfected cells 
respectively (Press, M. F., Pike, M. C, Chazin, V. R., Hung, G., Udove, J. A., 
Markowicz, M., Danyluk, J., Godolphin, W., Sliwkowski, M., Akita, R. and et aL 
(1993) Cancer Res, 53(20), 4960-70). In addition, the biologic changes induced by 
HER-2/ neu overexpression in the human ovarian cancer cells were similar to 

20 those seen and described above in the human breast cancer cells (Chazin, V.R. 
(1991). The biologic effects of HER-2///W proto-oncogene overexpression, 
Chapter 2. Department of Microbiology and Immunology, University of 
California, Los Angeles). Based on this consistent pattern of biologic changes 
induced by HER-2/ neu overexpression for both the MCF-7/HER-2 and CaOv- 

25 3/HER-2 cell lines, we would anticipate that at least some of the changes in 
mRNA expression patterns associated with HER-2 overexpression might be 
similar between these two cell lines if these genes are relevant to the HER-2/ neu 
overexpressing phenotype. Northern blot analysis of ovarian cancer cell line pair 
demonstrated that 12 of 16 (75%) clones found to be differentially expressed in 

30 MCF-7/HER-2 as compared to isogenic control breast cancer cells were also 
differentially expressed in the CaOv-3 ovarian cancer cells (Figure 15). In 
addition, for most of the clones, the degree of differential expression was 
consistent between these two distinct epithelial cell lines. This phenomenon is 
seen with clones HI 7, HI 8, H35, H37, and H41 which show marked increases in 

35 expression levels in association with HER-2/ neu overexpression and clones C49, 
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C72, H16, H45, and H63 which show more subtle differences (Figure 15). Four 
of the MCF-7 differentially expressed clones (C31, C40, H14, H31) did not 
demonstrate any noticeable difference in expression in CaOv-3/HER-2 vs. 
control cells. 

5 

13. Differential Expression of Two of the Identified cDNAs in Primary 
Human Breast Cancer Samples 

To further confirm differential expression of the novel (or 

10 uncharacterized) cDNAs in actual human malignancies, we examined a panel of 
primary human breast cancer specimens by Northern blot analyses. Among 
these, we were able to detect clear signals on the tumor Northern blots for two 
of the tested clones, H37 and H41. Less success with the other cDNAs (i.e. C40, 
HI 3, HI 4, HI 7, H63) can be best explained with their relatively rare message 

15 level. For the H37 cDNA, 15 individual cancer samples were analyzed, and 8 of 
these (#3, 4, 5, 7, 8, 12, 14, 15) overexpress HER-2/neu (Figure 16A). Seven of 
these eight tumors (88 %) demonstrated overexpression of the H37 transcript, 
while only one of seven (14%) of the non-HER-2 overexpressors overexpresses 
this cDNA (□ =0.732, p<0.005) (Figure 16A). For the H41 cDNA, 5 of 7 (71 %) 

20 of the HER-2 overexpressing malignancies overexpress the gene while two of 
eight (25 %) non-HER-2 overexpressors were found to have increased levels of 
this novel transcript (□ =0.464, p<0.075) (Figure 16B). Tumor #8 did not 
express high levels of either H37 nor H41 despite its high HER-2 expression 
level. Overall, these data suggest that the above two novel genes may be 

25 contribute in some way to the phenotype associated with HER-2 overexpression. 
Further studies of this association are currendy underway using greater numbers 
of tumor specimens. 

14, Extension of HOMPS-Encoding Polynucleotides 

30 Nucleic acid sequences of or Figure 1, Figure 3, Figure 5, Figure 7, Figure 

8 or Figure 10 can be used to design oligonucleotide primers for extending a 
partial nucleotide sequence to full length or for obtaining 5' or 3', intron or other 
control sequences from genomic libraries. One primer is synthesized to initiate 
extension in the antisense direction (XLR) and the other is synthesized to extend 

35 sequence in the sense direction (XLF). Primers are used to facilitate the extension 
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of the known sequence "outward" generating amplicons containing new, 
unknown nucleotide sequence for the region of interest. The initial primers are 
designed from the cDNA using OLIGO 4. 06 (National Biosciences), or another 
appropriate program to be 22-30 nucleotides in length, to have a GC content of 
5 50% or more, and to anneal to the target sequence at temperatures about 68 ° -72 
° C. Any stretch of nucleotides which would result in hairpin structures and 
primer-primer dimerizations is avoided. 

The original, selected cDNA libraries, or a human genomic library are 
used to extend the sequence; the latter is most useful to obtain 5' upstream 
10 regions. If more extension is necessary or desired, additional sets of primers are 
designed to further extend the known region. 

By following the instructions for the XL-PCR kit (Perkin Elmer) and 
thoroughly mixing the enzyme and reaction mix, high fidelity ampUfication is 
obtained. 

15 Beginning with 40 pmol of each primer and the recommended 

concentrations of all other components of the kit, PCR is performed using the 
Peltier Thermal Cycler (PTC200; M. J. Research, Watertown, Mass. ) and the 
following parameters: 



20 Step 1 94 °C. for 1 min (initial denaturation) 

Step 2 65 °C. fori, min 

Step 3 68 °C for 6 min 

Step 4 94 °C. for 15 sec 

Step 5 65 °C for 1 min 
25 Step 6 68°Cfor7min 

Step 7 Repeat step 4-6 for 15 additional cycles 

Step 8 94°C.forl5sec 

Step 9 65 °C. fori min 

Step 10 68 °C. for 7:15 min 
30 Step 11 Repeat step 8-10 for 12 cycles 

Step 12 72 °C for 8 min 

Step 13 4 °C (and holding) 
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A 5-10 |il aliquot of the reaction mixture is analyzed by electrophoresis on 
a low concentration (about 0. 6-0. 8%) agarose mini-gel to determine which 
reactions were successful in extending the sequence. Bands thought to contain 
the largest products are selected and removed from the gel. Further purification 
5 involves using a commercial gel extraction method such as the QIAQUICK kit 
(QIAGEN Inc. , Chatsworth, Calif.). After recovery of the DNA, Klenow 
enzyme is used to trim single-stranded, nucleotide overhangs creating blunt ends 
which facilitate religation and cloning. 

After ethanol precipitation, the products axe redissolved in 13 ul of 

10 ligation buffer, 1 ul T4-DNA ligase (15 units) and 1 ul T4 polynucleotide kinase 
are added, and the mixture is incubated at room temperature for 2-3 hours or 
overnight at 16 ° C. Competent E. coli cells (in 40 ul of appropriate media) are 
transformed with 3 ul of ligation mixture and cultured in 80 ul of SOC medium 
(Sambrook et al. , supra). After incubation for one hour at 37 ° C. , the whole 

15 transformation mixture is plated on Luria Bertani (LB)-agar (Sambrook et al, 
supra) containing 2. times. Garb. The following day, several colonies are 
randomly picked from each plate and cultured in 150 ul of liquid LB/2, times. 
Carb medium placed in an individual well of an appropriate, commercially- 
available, sterile 96-well microtiter plate. The following day, 5 ul of each 

20 overnight culture is transferred into a non-sterile 96-weil plate and after dilution 
1:10 with water, 5 ul of each sample is transferred into a PCR array. 

For PCR amplification, 18 ul of concentrated PCR reaction mix (3. 3. 
times. ) containing 4 units of rTth DNA polymerase, a vector primer, and one or 
both of the gene specific primers used for the extension reaction are added to 

25 each well. Amplification is performed using the following conditions: 



Step 1 94 °C for 60 sec 

Step 2 94 °C. for 20 sec 

Step 3 55 °C. for 30 sec 

30 Step 4 72 °C for 90 sec 

Step 5 Repeat steps 2-4 for an additional 29 cycles 

Step 6 72 °C. for 180 sec 

Step 7 4 °C (and holding) 
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Aliquots of the PCR reactions arc run on agarose gels together with 
molecular weight markers. The sizes of the PCR products are compared to the 
original partial cDNAs, and appropriate clones are selected, ligated into plasmid, 
and sequenced. 

5 

15. Labeling and Use of Hybridization Probes 

Hybridization probes derived from Figure 1, Figure 3, Figure 5, Figure 7, 
Figure 8 or Figure 10 are employed to screen cDNAs, genomic DNAs, or 
mRNAs. -Although the labeling of oligonucleotides, consisting of about 20 base- 

10 pairs, is specifically described, essentially the same procedure is used with larger 
cDNA fragments. Oligonucleotides are designed using state-of-the-art software 
such as OLIGO 4. 06 (National Biosciences), labeled by combining 50 pmol of 
each oligomer and 250 uCi of 32 P adenosine triphosphate (Amersham Pharmacia 
Biotech) and T4 polynucleotide kinase (DuPont NEN, Boston, Mass. ). The 

15 labeled oligonucleotides are substantially purified with SEPHADEX G-25 
superfine resin column (Pharmacia & Upjohn). A portion containing 10 7 counts 
per minute of each of the sense and antisense oligonucleotides is used in a typical 
membrane based hybridization analysis of human genomic DNA digested with 
one of the following endonucleases (Ase I, Bgl II, Eco RI, Pst I, Xba 1, or Pvu 

20 II; DuPont NEN). 

The DNA from each digest is fractionated on a 0. 7 percent agarose gel 
and transferred to nylon membranes (Nytran Plus, Schleicher & Schuell, 
Durham, N. H. ). Hybridization is carried out for 16 hours at 40°C. To remove 
nonspecific signals, blots are sequentially washed at room temperature under 

25 increasingly stringent conditions up to 0. 1. times, saline sodium citrate and 0. 5% 
sodium dodecyl sulfate. After XOMAT AR film Eastman Kodak Rochester, N. 
Y. ) is exposed to the blots in a Phosphoimager cassette (Molecular Dynamics, 
Sunnyvale, Calif.) for several hours, hybridization patterns are compared visually. 

30 16. Antisense or Complementary Sequences 

Antisense molecules or nucleic acid sequences complementary to the 
HOMPS-encoding sequence, or any part thereof, are used to inhibit in vivo or in 
vitro expression of naturally occurring HOMPS. Although use of antisense 
oligonucleotides, comprising about 20 base-pairs, is specifically described, 
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essentially the same procedure is used with larger cDNA fragments. An 
oligonucleotide based on the coding sequences of HOMPS, is used to inhibit 
expression of naturally occurring HOMPS. The complementary oligonucleotide 
is designed from the most unique 5' sequence and used either to inhibit 
5 transcription by preventing promoter binding to the upstream nontranslated 
sequence or translation of an HOMPS-encoding transcript by preventing the 
ribosome from binding. Using an appropriate portion of the signal and 5' 
sequence of Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10, an effective 
antisense oligonucleotide includes any 15-20 nucleotides spanning the region 
10 which translates into the signal or 5' coding sequence of the polypeptide. 

17. Expression of HOMPS 

Expression of HOMPS is accomplished by subcloning the cDNAs into 
appropriate vectors and transforming the vectors into host cells. In this case, the 

15 cloning vector, pINCYl, previously used for the generation of the cDNA library 
is used to express HOMPS in E. coli. Upstream of the cloning site, this vector 
contains a promoter for P-galactosidase, followed by sequence containing the 
ammo-terminal Met, and the subsequent seven residues of P-galactosidase. 
Immediately following these eight residues is a bacteriophage promoter useful for 

20 transcription and a linker containing a number of unique restriction sites. 

Induction of an isolated, transformed bacterial strain with IPTG using 
standard methods produces a fusion protein which consists of the first eight 
residues of P-galactosidase, about 5 to 15 residues of linker, and the full length 
protein. The signal residues direct the secretion of HOMPS into the bacterial 

25 growth media which can be used directly in the following assay for activity. 

18. Production of HOMPS Specific Antibodies 

HOMPS that is substantially purified using PAGE electrophoresis 
(Sambrook, supra), or other purification techniques, is used to immunize rabbits 
30 and to produce antibodies using standard protocols. The amino acid sequence 
deduced from Figure 1, Figure 3, Figure 5, Figure 8 or Figure 10 is analyzed using 
LASERGENE software (DNASTAR Inc) to determine regions of high 
immunogenicity and a corresponding oligopolypeptide is synthesized and used to 
raise antibodies by means known to those of skill in the art. Selection of 
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appropriate epitopes, such as those near the C-terminus or in hydrophilic regions, 
is described by Ausubel et al. (supra), and others. 

Typically, the oligopeptides are 1 5 residues in length, synthesized using an 
Applied Biosystems Peptide Synthesizer Model 431A using fmoc-chemistry, and 
5 coupled to keyhole limpet hemocyanin (KLH, Sigma, St. Louis, Mo. ) by reaction 
with N-maleimidobenzoyl-N-hydroxysuccirurnide ester (MBS; Ausubel et al , 
supra). 

Rabbits are immunized with the oligopeptide-KLH complex in complete 
Freund's adjuvant. The resulting antisera are tested for antipeptide activity, for 
10 example, by binding the peptide to plastic, blocking with 1% BSA, reacting with 
rabbit antisera, washing, and reacting with radioiodinated, goat anti-rabbit IgG. 

19. Purification of Naturally Occurring HOMPS Using Specific 
Antibodies 

15 Naturally occurring or recombinant HOMPS is substantially purified by 

irnmunoaffinity chromatography using antibodies specific for HOMPS. An 
irnmunoaffinity column is constructed by covalendy coupling HOMPS antibody 
to an activated chromatographic resin, such as CnBr-activated SEPHAROSE 
(Amersham Pharmacia Biotech). After the coupling, the resin is blocked and 

20 washed according to the manufacturer's instructions. 

Media containing HOMPS is passed over the irnmunoaffinity column, 
and the column is washed under conditions that allow the preferential 
absorb ance of HOMPS (e. g. , high ionic strength buffers in the presence of 
detergent). The column is eluted under conditions that disrupt antibody/ HOMPS 

25 binding (e.g., a buffer of pH 2-3 or a high concentration of a chao trope, such as 
urea or thiocyanate ion), and HOMPS is collected. 

20. Identification of Molecules Which Interact with HOMPS 

* HOMPS or biologically active fragments thereof are labeled with 125 I 
30 Bolton-Hunter reagent (Bolton et al. (1973) Biochem. J. 133: 529). Candidate 
molecules previously arrayed in the wells of a multi-well plate are incubated with 
the labeled HOMPS, washed and any wells with labeled HOMPS complex are 
assayed. Data obtained using different concentrations of HOMPS are used to 
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calculate values for the number, affinity, and association of HOMPS with the 
candidate molecules. 

All publications and patents mentioned in the specification are herein 
5 incorporated by reference. Various modifications and variations of the described 
method arid system of the invention will be apparent to those skilled in the art 
without departing from the scope and spirit of the invention. Although the 
invention has been described in connection with specific preferred embodiments, 
it should be understood that the invention as claimed should not be unduly 
10 limited to such specific embodiments. Indeed, various modifications of the 
described modes for carrying out the invention which are obvious to those skilled 
in molecular biology or related fields are intended to be within the scope of the 
following claims. 
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CLAIMS 

I . An isolated polynucleotide that encodes a polypeptide selected from the 
group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 9 and SEQ ID NO: 11. 

5 2. An isolated polypeptide comprising the amino acid sequence selected 
from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 9 and SEQ ID NO: 1 1 . 

3. An isolated variant of the polynucleotide selected from the group 
. consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 

10 NO: 7, SEQ ID NO: 8 and SEQ ID NO: 10. 

4. An isolated variant of the polypeptide selected from the group consisting 
of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 9 and 
SEQ ID NO: 11. 

5. An isolated polynucleotide that hybridizes to a sequence selected from 
15 the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 

SEQ ID NO: 7, SEQ ID NO: 8 and SEQ ID NO: 10 under highly 
stringent conditions. 

6. A polynucleotide sequence which is the complement of a polynucleotide 
of claim 1 . 

20 7. A composition comprising a polynucleotide of claim 1. 

8. A composition comprising a polypeptide of claim 2. 

9. An expression vector containing a polynucleotide of claim 1. 

10. A host cell containing the vector of claim 9. 

II. A method for producing a polypeptide comprising the amino acid 
25 sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID 

NO: 4, SEQ ID NO: 6, SEQ ID NO: 9 and SEQ ID NO: 11 comprising 
the steps of: a) culturing the host cell of claim 10 under conditions 
suitable for the expression of the polypeptide; and b) recovering the 
polypeptide from the host cell culture. 
30 12. An isolated antibody capable of binding to an epitope of a polypeptide 
set forth in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
NO: 9 and SEQ ID NO: 11. 
13. The antibody claim 12, wherein the antibody is monoclonal. 
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1*V A recombinant protein comprising the antigen binding region of a 
monoclonal antibody of claim 13. 

15. The antibody or fragment thereof of claim 13 or 14, or the recombinant 
protein of claim 13, which is labeled with a detectable marker. 
5 16. The antibody or fragment thereof or recombinant protein of claim 15, 
wherein the detectable marker is selected from the group consisting of a 
radioisotope, fluorescent compound, bioluminescent compound, 
chemiluminescent compound, metal chelator or enzyme. 

17. The antibody fragment of claim 13, which is an Fab, F(ab')2, Fv or Sfv 
10 fragment. 

18. A polynucleotide of claim 1 that is labeled with a detectable marker. 

19. An assay for detecting the presence of a polypeptide set forth in SEQ ID 
NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 9 and SEQ ID 
NO: 1 1 in a biological sample comprising contacting the sample with an 

15 antibody of claim 12, and detecting the binding of the polypeptide in the 

sample thereto. 

20. An assay for detecting the presence of a polynucleotide as shown in SEQ 
ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 8 and SEQ ID NO: 10 in a biological sample comprising contacting 

20 the sample with a complementary polynucleotide of claim 6, and 

detecting the binding of the complementary polynucleotide in the sample 
thereto. 
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1 


GGCACGAGCT 


GCGATAATAG 


51 


GGGCTCAGAG 


GGGCTATGAT 


101 


GGGCCTCTTG 


ACCCGGAGGC 


151 


ACTGGGATGG 


AAAGGTGTCT 


201 


CCTGGAAGGT 


CCTGTGATCT 


251 


GCACTCGGAT 


GTGGTGATCG 


301 


CCTATTGGCT 


GAAGAAGCTG 


351 


GTGGTGGAAC 


GGGACCACAC 


401 


AGGTGGGATT 


TGTCAGCAGT 


451 


TCTTTTCAGC 


CAGCTTTCTA 


501 


GATGCTCCTC 


CCCTGGACCT 


551 


GGCTTCAGAA 


AAGGATGCTG 


601 


GGCAGGAGGG 


AGCCAAAGTT 


651 


AAGTTTCCCT 


GGATAAACAC 


701 


GGAGGACGAA 


GGTTGGTTTG 


751 


GAAAGGTCCA 


GTCCTTGGGA 


801 


TTTGTCTCTT 


CATCTCAACG 


851 


CTTGAAAAGG 


ATCCATGAAG 


901 


ACCAGCCTGT 


GGAATGCGCC 


951 


GCGCAAATCG 


CAGCACTGGC 


1001 


GCAGGGCACC 


AAGCTACCTG 


1051 


GGCACTGCCC 


CCAGGGACCA 


1101 


AGTGGAGCCT 


ATTTTCGCCG 


1151 


TCGTAGCCCC 


ACTGAGCAGG 


1201 


ACCATGATTT 


CTTCCAGGAC 


1251 


CCAGCTTTTG 


AGACTCTGAA 


1301 


CTACAACACC 


TTTGACCAGA 


1351 


TCAACATGTA 


CTTTGCTACT 


1401 


CCTGGCATTG 


GGCGAGCTGT 


1451 


GACCATCGAC 


CTGAGCCCCT 



CGAGGCAGCA GTGCAGCTTT CAGAGGGTCC 
TCGGAGGGTT CTGCCGCACG GCATGGGCCG 
CAGGCACGCG CAGAGGAGGC TTTTCTCTGG 
GAGATTAAGA AGAAGATCAA GTCGATCCTG 
. ACTGCAAGAC ACCAGCCACC TGCCTCCCGA 
TGGGAGGTGG GGTGCTTGGC TTGTCTGTGG 
GAGAGCAGAC GAGGTGCTAT TCGAGTGCTA 
GTATTCACAG GCCTCCACCG GGCTCTCAGT 
TCTCATTGCC TGAGAACATC CAGCTCTCCC 
CGGAACATCA ATGAGTACCT GGCCGTAGTC 
CCGGTTCAAC CCCTCGGGCT ACCTCTTGCT 
CAGCCATGGA GAGCAACGTG AAAGTGCAGA 
TCTCTGATGT CTCCTGATCA GCTTCGGAAC 
AGAGGGAGTG GCTTTGGCGT CTTATGGGAT 
ACCCCTGGTG TCTGCTCCAG GGGCTTCGGC 
GTCCTTTTCT GCCAGGGAGA GGTGACACGT 
CATGTTGACC ACAGATGACA AAGCGGTGGT 
TCCATGTGAA GATGGACCGC AGCCTGGAGT 
ATTGTGATCA ACGCAGCCGG AGCCTGGTCT 
TGGTGTTGGA GAGGGGCCGC CTGGCACCCT 
TGGAGCCGAG GAAAAGGTAT GTGTATGTGT 
GGCCTAGAGA CTCCGCTTGT TGCAGACACC 
GGAAGGATTA GGTAGCAACT ACCTAGGTGG 
AAGAACCGGA CCCGGCGAAC CTGGAAGTGG 
AAGGTGTGGC CCCATTTGGC CCTGAGGGTC 
GGTTCAGAGC GCCTGGGCCG GCTATTACGA 
ATGGCGTGGT GGGCCCCCAC CCGCTAGTTG 
GGCTTCAGTG GTCACGGGCT CCAGCAGGCC 
AGCAGAGATG GTACTGAAGG GCAGGTTCCA 
TCCTCTTTAC CCGCTTTTAC TTGGGAGAGA 



FIG. 1A 
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1501 AGATCCAGGA GAACAACATC ATCTGAGCAT GTGTGCTCTG CACTGGCTCC 

1551 ACTGGCTTGC ATCCTGGCTG TGTTCACAGC CTTGTTTGCT GCTTCCATCT 

. 1601 TCCCCAGTAC TGTGCCAGGC CTTCTCCCCC TCCCCAGTGT CCTCTCCTCT 

1651 CAGGCAGGCC ATTGCACCCA TATGGCTGGG CAGGCACAGG CAGTGAGGCC 

1701 GAGGCCAATA GCGAGTGATG AGCGGGATCC TAGGACTGAT CTGTAGGCCA 

1751 TGCTGATGTC ACCCACCAGG GCAATCCATC TGGAGGCCTG AGCACCCTGG 

1801 CCCAGGACTG GCTTCATCCT GGCACTGACC AGGAAAGACT GCCTCTGACC 

1851 CTCTTAGCAG ACAGAGCCCA GGCATGGGAG CACTCTGGGG CAGCCTGGCT 

1901 CAGGTTTATT GATTTTCGTC TGTTTACCCT ATCCATTAAT CAATACATGT 

1951 AATTAACTCC TAAAAAAAAA AAAAAAAAAA A 

FIG. 1B 
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1 MIRRVLPHGM GRGLLTRRPG TRRGGFSLDW DGKVSEIKKK IKSILPGRSC 

51 DLLQDTSHLP PEHSDWIVG GGVLGLSVAY WLKKLESRRG AIRVLWERD 

101 HTYSQASTGL SVGGICQQFS LPENIQLSLF SASFLRNINE YLAWDAPPL 

151 DLRFNPSGYL LLASEKDAAA MESNVKVQRQ EGAKVSLMSP DQLRNKFPWI 

201 NTEGVALASY GMEDEGWFDP WCLLQGLRRK VQSLGVLFCQ GEVTRFVSSS 

251 QRMLTTDDKA WLKRIHEVH VKMDRSLEYQ PVECAIVINA AGAWSAQIAA 

301 LAGVGEGPPG TLQGTKLPVE PRKRYVYVWH CPQGPGLETP LVADTSGAYF 

351 RREGLGSNYL GGRSPTEQEE PDPANLEVDH DFFQDKVWPH LALRVPAFET 

401 LKVQSAWAGY YDYNTFDQNG WGPHPLWN MYFATGFSGH GLQQAPGIGR 

451 AVAEMVLKGR FQTIDLSPFL FTRFYLGEKI QENNII 

FIG. 2 
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1 


GGCACGAGCG 


GGGACGGAGC 


GAGCCGGCGC 


CAGGGCCCCT 


CGGGCCGGGA 


51 


AGAGGGGAAG 


GGGAGCGAGG 


TTGATGCCCG 


GCGGAGGGGC 


GAGCGCGGCG 


101 


TCTGGCCGGC 


TTCTCACCGC 


CGCGGAGCAA 


AGAGGGTCCC 


GGGAAGCGGC 


151 


AGGGTCGGCG 


TCCAGGAGCG 


GCTTCGGGGG 


CTCCGGCGGC 


GGCAGAGGCG 


201 


GAGCAAGCGG 


CCCCGGGTCC 


GGGAGCGGAG 


GCCCTGGGGG 


CCCCGCGGGC 


251 


AGGATGAGCT 


TGACCCCGAA 


GGAGCTCTCG 


AGCCTGCTGA 


GCATCATATC 


301 


GGAGGAGGCG 


GGCGGCGGCA 


GCACCTTCGA 


GGGCCTGTCC 


ACCGCCTTCC 


351 


ACCACTACTT 


CAGCAAGGCC. 


GACCACTTCC 


GCCTGGGCTC 


GGTGCTCGTC 


401 


ATGCTGCTCC 


AGCAGCCCGA 


CCTGCTGCCT 


AGCGCGGCGC 


AGCGCCTCAC 


451 


GGCGCTCTAC 


CTGCTCTGGG 


AGATGTACCG 


CACCGAGCCG 


CTGGCCGCCA 


501 


ACCCCTTCGC 


CGCCAGCTTC 


GCGCACCTGC 


TGAACCCCGC 


GCCGCCCGCC 


551 


CGCGGCGGCC 


AGGAACCCGA 


CCGCCCTCCG 


CTCTCAGGAT 


TTTTACCTCC 


601 


TATAACTCCA 


CCAGAAAAGT 


TTTTTCTTTC 


CCAGCTGATG 


CTGGCACCCC 


651 


CACGGGAACT 


CTTCAAAAAG 


ACGCCTCGCC 


AGATTGCACT 


GATGGACGTT 


701 


GGAAACATGG 


GCCAGTCTGT 


GGACATTAGT 


GGGCTTCAGT 


TAGCCTTGGC 


751 


CGAACGCCAA 


TCTGAATTGC 


CAACGCAAAG 


CAAAGCGAGC 


TTCCCCAGTA 


801 


TTCTCAGTGA 


CCCAGACCCG 


GATTCTTCTA 


ATTCTGGATT 


TGACAGCTCA 


851 


GTTGCCTCTC 


AGATCACAGA 


AGCTTTAGTC 


AGCGGACCAA 


AGCCACCTAT 


901 


TGAAAGCCAT 


TTTCGACCAG 


AGTTTATTCG 


TCCACCGCCT 


CCACTCCACA 


951 


TTTGTGAGGA 


TGAACTTGCT 


TGGCTAAACC 


CCACGGAGCC 


TGACCACGCG 


1001 


ATCCAGTGGG 


ATAAATCGAT 


GTGTGTTAAG 


AATAGCACTG 


GTGTGGAGAT 


1051 


CAAACGAATA 


ATGGCCAAAG 


CCTTCAAAAG 


CCCCTTATCC 


TCTCCCCAAC 


1101 


AAACACAGCT 


ACTTGGTGAG 


TTGGAAAAAG 


ACCCCAAACT 


TGTCTACCAT 


1151 


ATTGGCCTCA 


CCCCAGCCAA 


ACTTCCTGAC 


CTTGTGGAAA 


ACAACCCTTT 


1201 


AGTCGCTATA 


GAAATGTTGC 


TGAAATTAAT 


GCAGTCAAGC 


CAGATCACTG 


1251 


AGTATTTCTC 


TGTCCTGGTC 


AATATGGACA 


TGTCTTTACA 


TTCAATGGAA 


1301 


GTTGTAAATC 


GACTAACTAC 


AGCTGTTGAT 


CTACCTCCTG 


AATTTATTCA 


1351 


CCTTTATATA 


TCAAATTGCA 


TCTCTACTTG 


TGAACAGATT 


AAGGATAAAT 


1401 


ATATGCAGAA 


TCGGTTGGTG 


CGTCTTGTGT 


GTGTGTTTCT 


CCAATCCTTG 


1451 


ATCCGTAACA 


AAATTATTAA 


TGTACAGGAT 


TTGTTTATAG 


AAGTGCAGGC 


1501 


ATTCTGTATT 


GAATTCAGTA 


GGATACGAGA 


AGCTGCTGGT 


CTTTTCCGGT 



FIG. 3A 
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1551 TGTTGAAGAC ATTGGATACT GGGGAAACAC CTTCTGAGAC CAAAATGTCA 

1601 AAATAATACC TCATCAGAAC CATCCCATCC ATTCACTGTT CAGCTGTACT 

1651 GTGATTTAGT TTTTACACCG TTAAAACCCT GAGTGGATTG CTTGGTTTAA 

1701 TGCATATAAA CAGTACTTTA TCTACTTAAA GCAAAAAAAA AAAAAAAAAA 

FIG. 3B 
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1 MPGGGASAAS GRLLTMEQR GSREAAGSAS RSGFGGSGGG RGGASGPGSG 

51 SGGPGGPAGR MSLTPKELSS LLSIISEEAG GGSTFEGLST AFHHYFSKAD 

101 HFRLGSVLVM LLQQPDLLPS AAQRLTALYL LWEMYRTEPL AANPFAASFA 

151 HLLNPAPPAR GGQEPDRPPL SGFLPPITPP EKFFLSQLML APPRELFKKT 

201 PRQIALMDVG NMGQSVDISG LQLALAERQS ELPTQSKASF PSILSDPDPD 

251 SSNSGFDSSV ASQITEALVS GPKPPIESHF RPEFIRPPPP LHICEDELAW 

301 LNPTEPDHAI QWDKSMCVKN STGVEIKRIM AKAFKSPLSS PQQTQLLGEL 

351 EKDPKLVYHI GLTPAKLPDL VENNPLVAIE MLLKLMQSSQ ITEYFSVLVN 

401 MDMSLHSMEV VNRLTTAVDL PPEFIHLYIS NCISTCEQIK DKYMQNRLVR 

451 LVCVFLQSLI RNKIINVQDL FIEVQAFCIE FSRIREAAGL FRLLKTLDTG 

501 ETPSETKMSK 

FIG. 4 
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1 GGCACGAGC.T GGCTCGCGCG TGCCTTTTCC CCTCAGGTTG TGGGGAGAGC 

51 GGAATCCTGC TCCGCCGTCG CAGCAGCAGC GGCAGCCCCG GCAGCCTCGG 

101 GCGACAGCGG CGGCGCGCGA GCCCCCGGGC GGACCGTACC ACCGCTCGCC 

151 AGCACGCAGG GGGAGCCGCC CGTCTCGCCG CGCACGCCTC GGCGACCCCG 

201 CGGGGCTGAG GCGTCGCCGC GCCCGGCAGC GTGAGCGCAG AGCCGGCCTC 

251 GACCCCGAGC TCGGAGCCCC GCGGGCCGCG CCCGCCGCCG GCCCCACCCA 

301 TCCGGGTCGA GGAGGCCGAG GCCATGGCTG AGACGGAGGA GCGGAGCCTG 

351 GACAACTTCT TTGCCAAGAG GGACAAGAAG AAGAAGAAGG AGCGGAGCAA 

401 CCGGGCGGCG AGTGCCGCGG GCGCAGCGGG CAGCGCCGGC GGAAGCAGTG 

451 GAGCCGCGGG TGCGGCGGGC GGCGGGGCGG GCGCGGGGAC CCGGCCGGGT 

501 GACGGCGGGA CCGCCAGCGC GGGGGCTGCG GGCCCAGGGG CCGCCACCAA 

551 GGCTGTGACG AAGGACGAAG ATGAATGGAA AGAATTGGAG CAAAAAGAGG 

601 TTGATTACAG CGGCCTCAGG GTTCAGGCAA TGCAAATAAG CAGTGAAAAG 

651 GAAGAAGACG ATAATGAAAA GAGACAAGAT CCAGGTGATA ACTGGGAAGA 

701 AGGTGGAGGT GGTGGTGGAG GTATGGAAAA ATCTTCAGGT CCCTGGAATA 

751 AAACAGCTCC AGTACAAGCA CCTCCTGCTC CAGTAATTGT TACAGAAACC 

801 CCAGAACCAG CGATGACTAG TGGTGTGTAT AGGCCTCCTG GGGCCAGGTT 

851 AACCACAACA AGGAAAACAC CACAAGGACC ACCAGAAATC TACAGTGATA 

901 CACAGTTCCC ATCCCTGCAG TCAACTGCCA AGCATGTAGA AAGCCGGAAG 

951 GATAAAGAAA TGGAGAAGAG CTTTGAAGTA GTAAGACACA AAAATAGAGG 

1001 TAGGGATGAG GTTTCAAAAA ACCAGGCCCT TAAACTTCAG CTAGACAACC 

1051 AATATGCTGT GCTTGAAAAT CAGAAAAGCA GCCACTCACA ATACAATTAA 

1101 GGAATGGGCT TTGCTAACCC TTCTGAGGTA ACTAGACTGC AGCTAACCAC 

1151 CACCAACAGC CATTCATCAT CTGATCTCTG CTGGATCTAC AGACACCGAT 

1201 GCAGACCACT CGATTTCATG ACCGGCCCTA TTGCACTATG GAAGTTAAAG 

1251 TGTCACGACT GCTCTATGCA TATTGGATTT AGGGGAATTT TCATTGTTAC 

1301 ATAAATGTGT GAACTAGTTT CAACAGTGTT CTTTCATATT TACTCTGCAA 

1351 ATACAAAAAA CCAAAACCTG CAGCCAGTGG TCATTTCAAA ATCTTTTTAT 

1401 GTTCAGATAC TGAGCCTTCA TAAGGGTTGA CTACCTCAGA TTTGCTGCAC 

1451 TCATTGTGGA CTTCATGTGG ATCACAACTT CTGGATAAGA AGATTACAAC 

1501 TATTAAGTGT CGATGTGAAC CTTGCAACCA GCTCTACTGG ATTCTTATCA 

FIG. 5A 
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1551 GAAATCCTGC ATAAAAAGTC AGCCATCTGG GTTCTGATCT GCTGTAAAAG 

1601 ATGAAGATTT AAGTGACCTT AATTAACCTG TCCTGTGCCC TACCCTTAAG 

1651 GAATACTCTC TGTAGTAGGC TGTTGTTATA TTAGACTTCC TGGAACACAC 

1701 CGCTGAAAAG AACTGATGTG TTCAGATCAT CTGTGTAGGG CTGTGATTTG 

1751 TAATTTAAAC TAATTGTATT CTGAGGTAAC CACAAAATAA ATTCAACCAA 

1801 ACTGGGGTCC ACCAAGTGGG GGAAGGGGAA GGGAGAGAAT AATCTTGGGG 

1851 GTTTTTTTTT TTGGTAATTT TTTTATTTGG ATAGTGCTTT TTTGTTTTGT 

1901 TTTTGTTCTG CATTAAGGCC TTTTTTGCTT TGACTTGAAA TAAGTTCTTT 

1951 GACAGAGCAT ATTGCTTGGT TAATTAAGTA ACCTAAAGTA TGCATTAGGA 

2001 TTGTGAAATG TCTCGTGAGT ATGCGAATCC TGAGGGTGGA ACCAMTAGQ: 

2051 CTTTGATGAA AAGGGCAGTG GATTCTGGAG GCTCTACTTC AGGTGCTGCT 

2101 ATAATGCCTC ATCTAATCAG GACTAAATTG TGTAGGAAAC TGCAGTGGGA 

2151 AGAATATGCT TTCTGCTCAG GCTAAGAGGG TCACTGATCT GTCCTTAGAA 

2201 ATTCAGAGTA ACATGAGCAA AACCTCAGCT AAAACCCATT TAAGTGGCAT 

2251 GGATTGTGCA TGATCTTTGA TAAGAATTCC TCATGTACTT GTGCCTAGTT 

2301 TTTCAAGGTA TTGGCTGTTC TATAGATGCA GTGATTGTCC CAGCTAGCTC 

2351 TGTTACCAGC CTTTTGGTGT GTGTTTATGT TCATTTGGAG AGTGAGGGCG 

2401 AAAGACAGGT GATGTAGCAC TTCTGTTTTT AATAATTATT GCTTAAAATA 

2451 CCTATTAATA GTTTTGGGTC ATTTAAAGGG ACTTGAGGAA GCTACCGAGG 

2501 ATTACAGAAG AGTGTCCACC TAACAAGATG GTCTGGCAGT TTCCTAGTTT 

2551 TGTATCTGGT TCAATAGAAA TATGTGAAAG TGGTAATGTC ATCATTTGAT 

2601 GCAGAGTCCG GGTTTCTCTA TAATAAATCC CTTTGCCAAA TGCATGAGTT 

2651 GCAGACTTGC TACTGGCAAG AGTGAAGCAA GTGGGTGAGT AAAACTATTT 

2701 TGACGTGGGA GCGTTTTCAG ATAGGAGTTT AGTCTTGACG AAAGTGTCCG 

2751 TGCAGGAATT GGACTCCGAG GAGGGTTACA GTATCTCCTG ACGGGACCTG 

2801 CCACTCGCAT CTGGGCAATG TTGACATTTG AGGTGGCAGG CAGGATGCCT 

2851 GCCTTCTAAT ATATTTGGGT GAGTAACTGA GGCAGCCAAG GGAAGGTTGA 

2901 ATGATTAAAT CAGAAATGGG ATTCTTGGTA AACTGAAGAC TTTTATTTGG 

2951 GAATGAAAAA CCTTAAAAAA ATCTCTTCAT CGTTGAACTG TGCATTTTCC 

3001 CTGCATTTTT TCCCAACAAA ATTTTGTTGG . GGGTTATGTT ACTGAAGAAT 

3051 GAACAGATGA GTAAGTGGAG GTGTTATGTA AAGGCATATT GTACTCGAAA 

3101 TCTGAAGACC TGCAGCAGAT TTAAATTACA ACTCTTGTTA TAACTTTTTA 

3151 AAAGATTGTG AAAATATCAA AATATAAATG AATCAAGTTT TAATATACTG 

3201 TATGATGGGT GGATGAGGCT GTCCATTGTA CCATTTGTTT GAATTCTCAG 

3251 GCATGGTTTG GCAGTGCAAG AATTCTGTAA CATTAACAAA TTCAATAAAA 

3301 AGTAAATATA TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

FIG.5B 
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1 MAETEERSLD NFFAKRDKKK KKERSNRAAS AAGAAGSAGG SSGAAGAAGG 

51 GAGAGTRPGD GGTASAGAAG PGAATKAVTK DEDEWKELEQ KEVDYSGLRV 

101 QAMQISSEKE EDDNEKRQDP GDNWEEGGGG GGGMEKSSGP WNKTAPVQAP 

151 PAPVIVTETP EPAMTSGVYR PPGARLTTTR KTPQGPPEIY SDTQFPSLQS 

201 TAKHVESRKD KEMEKSFEW RHKNRGRDEV SKNQALKLQL DNQYAVLENQ 

251 KSSHSQYN 

FIG. 6 
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GGCAC GAGGTGGCAT AGCATAACCA CAGTAAGAAC 

AGAACAGATA TTCAGCAGAA AACTTTTTAT ACTCTAATTC TTTTTTTTTT 
TTTTTTTGAG ACAGAGTTTT AGTCTTGTTT CCCAGGCTGG AGTGCAATGG 
CACAATCTTG GCTCACTGCA ACCTCCGCCT CCTGGGTTCA GGCAATTTTC 
CTGCCTCAGC CTCCCAAGTA GCTGGGATTA CAGGCACCCA CCACCATGCC 
CAGCTAATTT TTGTATTTTT AATAGAGAGC TAATAATTGT ATATTTAATA 
AAGACGGGTT TCACCATGTT GGCCAGGCTG GTCTTGAACT CCTGACCTCA 
GGTGATCCTC CTGCATTGGC CTCCCAAAGT GCTGGAATTC CAGGCATGAG 
CCACTGCGCC CAGTCTACAC ACTAATTCTT GTTAGCCCAA CAGCTGTTCT 
GTTCTATCTA CCCCTCATTT CACGCTCAAG GAGTCATACC TAGAATAGTT 
ACACACAAGA GGGAAACTGG AAGCCAAACA CTGTACAGTA TTGTGTAGAA 
AGTCACCTCC CTACTCCTTT TATTTTACAT GAGTGGTGAT GTGTTTTGGC 
AGATGAGCTT TCAGCTGAGG CCTGATGGAA ATTGAGATAA CCTGCAAAGA 
CATAACAGTA TTTATGAGTT ATATCTTAGT TCTTGAAATT GTGGAATGCA 
TGATTGACAA TATATTTTTA ATTTTTATTT TTTCAAGTAA TACCAGTACT 
GTTTAACTAT AGCCAGAACT GGCTAAAATT TTTATATTTT CAGAGTTGAA 
GTTGGTGAAG ACATTCATGA TTTAAACACC AGATCCTGAA AGGGGTTAAA 
TCTACTTTGA AATGAATCTG CAATCAGTAT TTCAAAGCTT TTCTGGTAAT 
TTTAGTGATC TTATTTGATT AGACTTTTTC AGAAGTACTA AATAAGGAAT 
TTTAACAGGT TTTTATTMT GCACAGATAA ATAGAAGTAC AGTGAGGTCT ■ 
ATAGCCATTT TATTAAAATA GCTTAAAAGT TTGTAAAAAA ATGAATCTTT 
GTAATTACTT AATATGTTAG TTAAGAACCC GTCAAGCTTA TATTTGCTAG 
ACTTACAAAT TATTTTAAAT GCATTTATCT TTTTTGACAC TATTCAGTGG 
AATGTGTAAG CTAGCTAATT CTTGTTTTCT GATTTAAAGC ACTTTTAAAT 
CTTATCCTGC CCCCTAAAAA CAAAAGGTTT TGATCACAAG GGGAAATTTA 
AGATTGTTAA CCCTGTTTTT CAGAAGGGCT ACTGTTAATT GCACATAAAC 
ATGAAATGTG TTTTCCCCTG TGTACTAACA CATTCTAGGC AAAATTCAAA 
CTTATAGTGG TAAAGAAACA GGTTGTTCAC TTGCTGAGGT GCAAAAATTC 
TTAAGACTTC TGTTTGAAAT TGCTCAATGA CTAGGAAAAG ATGTAGTAGT 
TTACTAAAAT TGTTTTTCTA CCATATCAAA TTAAACAATT CATGCCTTTT 
TAGGGTCAGG CCTACAATGA ATAGGTATGG TGGTTTCACA GAATTTTAAA 
ATAGAGTTAA AGGGAAGTGA TGTACATTTC GGGGGCATTA GGGTAGGGAG 
ATGAATCAAA AAATACCCCT AGTAATGCTT TATATTTTAA TACTGCAAAA 
GCTTTACAAA TGGAAACCAT GCAATTACCT GCCTTAGTTC TTTTGTCATA 
AAAACAATCA CTTGGTTGGT TGTATTGTAG CTATTACTTA TACAGCAACA 
TTTCTTCAAT TAGCAGTCTA GACATTTTAT AAACAGAAAT CTTGGACCAA 
TTGATAATAT TTCTGACTGT ATTAATATTT TAGTGCTATA AAATACTATG 
TGAATCTCTT AAAAATCTGA CATTTTACAG TCTGTATTAG ACATACTGTT 
TTTATAATGT TTTACTTCTG CCTTAAGATT TAGGTTTTTT AAATGTATTT 
TTGCCCTGAA TTAAGTGTTA ATTTGATGGA AACTCTGCTT TTAAAATCAT 
CATTTACTGG GTTCTAATAA ATTAAAAATT AAACTTGTAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 

FIG. 7 
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1 GGCACGAGGT CCCACCTTGT GGAGGATGGA GGTGACCGGG GACGCCGGGG 

51 TACCAGAATC TGGCGAGATC CGGACTCTAA AGCCGTGTCT GCTGCGCCGC 

101 AACTACAGCC GCGAACAGCA CGGCGTGGCC GCCTCCTGCC TCGAAGACCT 

151 GAGGAGCAAG GCCTGTGACA TTCTGGCCAT TGATAAGTCC CTGACACCAG 

201* TCACCCTGGT CCTGGCAGAG GATGGCACCA TAGTGGATGA TGACGATTAC 

251 TTTCTGTGTC TACCTTCCAA TACTAAGTTT GTGGCATTGG CTAGTAATGA 

301 GAAATGGGCA TACAACAATT CAGATGGAGG TACAGCTTGG ATTTCCCAAG 

351 AGTCCTTTGA TGTAGATGAA ACAGACAGCG GGGCAGGGTT GAAGTGGAAG 

401 AATGTGGCCA GGCAGCTGAA AGAAGATCTG TCCAGCATCA TCCTCCTATC 

451 AGAGGAGGAC CTCCAGATGC TTGTTGACGC TCCCTGCTGA GACGTGGCTC 

• 501 AGGAACTACG. TCAGAGTTGT GCCACCGTCC AGCGGCTGCA GCACACACTC 

551 CAACAGGTGC TTGACCAAAG AGAGGAAGTG CGTCAGTCCA AGCAGCTCCT 

601 GCAGCTGTAC CTCCAGGCTT TGGAGAAAGA GGGCAGCCTC TTGTCAAAGC 

651 AGGAAGAGTC CAAAGCTGCC TTTGGTGAGG AGGTGGATGC AGTAGACACG 

701 GGTATCAGCA GAGAGACCTC CTCGGACGTT GCGCTGGCGA GCCACATCCT 

751 TACTGCACTG AGGGAGAAGC AGGCTCCAGA GCTGAGCTTA TCTAGTCAGG 

801 ATTTGGAGGT GGGCGGAAAC CAGGGTCACT GAGCTACAGA GGAGGACATG 

851 CCCTGGGATG TAGTAGTATC ATGCAGAGGT GTGTGGGCCC TTTTGTTCAC 

901 CTCTGCAGAC TGTGAATCCT AGCTGCCAGT TTGCCTATTA TATGCCAAGG 

951 CATTTGCAAA AATCTCATTA ATCTAAATCA AAATAGCTTT AAAGAAAAAT 

1001 GCAAAAAAAA AAAAAAAAAA AAAAAAA 

FIG. 8 
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1 MEVTGDAGVP ESGEIRTLKP CLLRRNYSRE QHGVAASCLE DLRSKACDIL 

51 AIDKSLTPVT LVLAEDGTIV DDDDYFLCLP SNTKFVALAS NEKWAYNNSD 
101 GGTAWISQES FDVDETDSGA GLKWKNVARQ LKEDLSSIIL LSEEDLQMLV 
151 DAPCSDLAQE LRQSCATVQR LQHTLQQVLD. QREEVRQSKQ LLQLYLQALE 
201 KEGSLLSKQE ESKAAFGEEV DAVDTGISRE TSSDVALASH ILTALREKQA 
251 PELSLSSQDL EVGGNQGH 



FIG. 9 
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1 GGCACGAGGC TAAATGTAGA CAATGGTTAG AGAAGAATTT TCCAAATGAA 

51 TTTGCAAAAC TTACTGTAGA AAATTCACCC AAACAAGAAG CTGGAATTAG 

101 TGAGGGTCAA GGAACAGCAG GGGAAGAAGA GGAGAAGAAA AAACAGAAGA 

151 GAGGTGGAAG GGGTCAAATA AAACAAAAAA AGAAGACCGT ACCACAAAAG 

201 GTTACTATAG. CCAAAATTCC CAGAGCAAAG AAGAAATATG TGACAAGAGT 

251 ATGTGGCCTT GCAACTTTTG AAATTGATCT TAAAGAAGCA CAAAGATTTT 

301 TTGCTCAAAA ATTCTCCTGT GGTGCCTCAG TAACAGGGGA GGATGAAATT 

351 ATCATTCAGG GAGATTTTAC AGATGACATA ATTGATGTCA TTCAGGAAAA 

401 ATGGCCAGAG GTAGATGATG ACAGCATCGA AGATCTTGGA GAAGTAAAGA 

451 AGTGAATTTG AAAATTTGTC TGTATTTAAT GGCCTGAACT GAGAGTTGAT 

501 ATGGCCAAAG GGAGAGAGGC CTTTTAAAAT ATATATATAT ATACACATAT 

551 ATATGTATAT ATACACATAT ATGTATGTAT ACACATATAC ACATGTATAT 

601 ATACATGTGT GTATGTATGC ATGTATATAC ATATATACAT ACACATATAT 

651 GTATACATAT ATACACATAT ATGTATACAT ATATACACAT ATATGTATAC 

701 ATATATATAT ATTCTACAGT AAAACTGTAG ACTGTCCTCG TCCTTGGCAT 

751 TTTCACTGTT CTGTACAAGG CTGCTTGTTT TTTTATTGCC AAAGTCAAAT 

801 AAACGGGAGA CTGTCATGCT CATGCATGAA TAGAATTTAG TCAAATAAAA 

851 AATTTTGGTC ATTTGGTACT GACTTTCTCT CTCTCTCTCT CTCTCTTTTT 

901 TTTTTTTGAC AGAGTCTCGC ACTGTTGCCT GGGCTGGAAT GCAGTGGTGC 

951 GATCTCGGCT CACTGCAACC TCCGCCTCCC GGGTTCAAGT GATTCTCCTG 

1001 CCTCAGCCTC CCAAGTAGGT GGGATTACAG GCGCCCGCCA CCACGCCCAG 

1051 CTAATTTTTG TATTTTTAGT AGAGATGGGG TTTCACTATG TTGGCCAGGC 

1101 TGGTCTCGAA CTCCTGACCT CGTGATTGGC CCACCTCAGC CTCCCAAAGT 

1151 GCTGGTATTA CAGATGTGAG CCACCGCACC CAGCCTGAGT TTCTCTTTCT 

1201 • CTCTTTTTAA CTTTATTTTT TGAAAAACCC GGTAGACTTT GTGGGGAGCA 

1251 TTTTTGTTGA TAATTTTACT GATCTAAAGC TGAGTGATTT TTTAAAAGAA 

1301 TTTGAATTTG GCTTCCTCAC CAGTAATATG TCTCCTTGCT TCTTTGATGT 

1351 GATAGTTTTG AGATGGGTGA GAATCTAATA GATCTGTGGT TGAATTTGCT 

1401 GTGTTGTTAT GAAGTCCACC CTGTGGGCAC AATAACATAA CTGTTGGTAG 

1451 GAGTTGTTTG AGCTATTCTG GAGATTATTT GGTAAAGTAT ACTAAAAGCC 

.1501 TTAAAACCAT GTATGTGCGC TGTTTGAACC AGTAAGCCAC TTCTTTGACA 

1551 TTAGAAGACA TTAGAAGAAA TAATCAGCCT TGCATAAAAC TTATGGATGA 

FIG. 1 0A 
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1601 AAGTATTCAT CACAATATTA TTTATAATAA AAAATTGCAA ATGTTATAAA 

1651 TGAACAATTG GGAAATGGTT AAAGAAGTGA TGGTGCATTG TGTGGTAGAA 

1701 TATTATGGAT ATGTTTAAAG AATCATATTT TCTAAGATTA TTTGGAAGCA 

1751 TGTTTGGTAA TGTCAAGTGG AGTACCCCAG ATACATTTTA GACATTTATC 

1801 GTCATCATCT GCTCTGAGTG GAAGGCCGTT CAGAGAGGCT AGAGGTTCTT 

1851 ATTCTGGCTA TAAATTATGT GAGTAAAATT GTGCTAACCA GTTAAAAGTA 

1901 CTGTACACCC ATGCTCAATA TATAGTCCTG GAAATAGCAA TTGAAACATG 

1951 TCTTCTCACA AGAGAAAATG ACAGTTTTAA TGATGTATTT GATGAATTTA 

2001 AACTTTAAGT CAGGTGCTGC AAATTGGAAA GAAGACTTGT GGTGTTTTAA 

2051 GTTGCTGTGG ACACTTTTAA GAAACTTAGA ACCCATGGAA CCCTTGTTTA 

2101 TCGCCATGCA AATTACAATC TTGAATGAGT GTTTTTTTAA AAATAAAGTA 

2151 TTAGAAAAAT GTGTAGTAAA GATGTAAAAT TAAAAAAAAA AAAAAAAAAA 

2201 AAAAAAAAAA AAAA 

FIG. 10B 
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1 HEAKCRQWLE KNFPNEFAKL TVENSPKQEA GISEGQGTAG EEEEKKKQKR 
51 GGRGQIKQKK KTVPQKVTIA KIPRAKKKYV TRVCGLATFE IDLKEAQRFF 
101 AQKFSCGASV TGEDEIIIQG DFTDDIIDVI QEKWPEVDDD SIEDLGEVKK 

FIG. 11 
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CONTROL C40 H13 H17 H41 H37 




FIG. 13 
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SEQUENCE LISTING 

<110> Dennis J. Slamon 
Juliana J. Oh 

<120> DIFFERENTIALLY EXPRESSED GENES 

ASSOCIATED WITH HER-2/NEU OVEREXPRESSION 



<130> 30448. 79USU1 

<150> US 60/157,923 
<151> 1995-10-06 

<160> 14 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1981 
<212> DNA 

<213> Homo Sapiens 

<220> 
<221> CDS 

<222> (66) . . . (1526) 
<400> 1 

ggcacgagct gcgataatag cgaggcagca gtgcagcttt cagagggtcc gggctcagag 
qggct atg att egg agg gtt ctg ccg cac ggc atg ggc egg ggc etc ttg 
Met He Arg Arg Val Leu Pro His Gly Met Gly Arg Gly Leu Leu 
■ ! 5 10 15 

acc egg agg cca ggc acg cgc aga gga ggc ttt tct ctg gac tgg gat 
Thr Arg Arg Pro Gly Thr Arg Arg Gly Gly Phe Ser Leu Asp Trp Asp 
20 25 30 

gga aag gtg tct gag att aag aag aag ate aag teg. ate ctg cct gga 
Gly Lys Val Ser Glu lie Lys Lys Lys He Lys Ser lie Leu Pro Gly 
35 40 . 45 

agg tec tgt gat eta ctg caa gac acc age cac ctg cct ccc gag cac 
Arq Ser Cys Asp Leu Leu Gin Asp Thr Ser His Leu Pro Pro Glu His 
50 55 60 

teg gat gtg gtg ate gtg gga ggt ggg gtg ctt ggc ttg tct gtg gee 
Ser Asp Val Val lie Val Gly Gly Gly Val Leu Gly Leu Ser Val Ala 
65 70 75 

tat tgg ctg aag aag ctg gag age aga cga ggt get att cga gtg eta 
Tyr Trp Leu Lys Lys Leu Glu Ser Arg Arg Gly Ala He Arg Val Leu 
80 8 5 90 95 

gtg gtg gaa egg gac cac acg tat tea cag gee tec acc ggg etc tea 
Val val Glu Arg Asp His Thr Tyr. Ser Gin Ala Ser Thr Gly Leu Ser 
100 105 11° 

gta ggt ggg att tgt cag cag ttc tea ttg cct gag aac ate cag etc 
Val Gly Gly He Cys Gin Gin Phe Ser Leu Pro Glu Asn He Gin Leu 
115 120 125 

tec etc ttt tea gee age ttt eta egg aac ate aat gag tac ctg gec 
Ser Leu Phe Ser Ala Ser Phe Leu Arg- Asn He Asn Glu Tyr Leu Ala 
130 135 140 



60 
110 



158 



206 



254 



302 



350 



398 



446 



494 



gta gtc gat get cct ccc ctg gac etc egg ttc aae ccc teg ggc tac 542 
Val Val Asp Ala Pro Pro Leu Asp Leu Arg Phe Asn Pro Ser Gly Tyr 
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145 150 155 

etc ttg ctg get tea gaa aag gat get gca gee atg gag age aac gtg 590 
Leu Leu Leu Ala Ser Glu Lys Asp Ala Ala Ala Met Glu Ser Asn Val 
160 165 170 1*75 

aaa gtg cag agg cag gag gga gee aaa gtt tct ctg atg tct cct gat 638 
Lvs Val Gin Arg Gin Glu Gly Ala Lys Val Ser Leu Met Ser Pro Asp 
. 180 185 190 

cag ctt egg aac aag ttt ccc tgg ata aac aca gag gga gtg get ttg 
Gin Leu Arg Asn Lys Phe Pro Trp He Asn Thr Glu Gly Val Ala Leu 
195 200 205 



act ctg aag gtt cag age. gee tgg gec ggc tat tac gac tac aac ace 
Thr Leu Lys Val Gin Ser Ala Trp Ala Gly Tyr Tyr Asp Tyr Asn Thr 
405 410 415 



400 



686 



830 



878 



926 



gcg tct tat ggg atg gag gac gaa ggt tgg ttt gac ccc tgg tgt ctg 734 
Ala Ser Tyr Gly Met Glu Asp Glu Gly Trp Phe Asp Pro Trp Cys Leu 
210 215 • 220 

etc cag ggg ctt egg cga aag gtc cag tec ttg gga gtc ctt ttc tgc 782 
Leu Gin Gly Leu Arg Arg Lys Val Gin Ser Leu Gly Val Leu Phe Cys 
225 230 235 

cag gga gag gtg aca cgt ttt gtc tct tea tct caa cgc atg ttg ace 
Gin Gly Glu Val Thr Arg Phe Val Ser Ser Ser Gin Arg Met Leu Thr 
240 . 245 250 255 

aca gat gac aaa gcg gtg gtc ttg aaa agg ate cat gaa gtc cat gtg 
Thr Asp Asp Lys Ala Val Val Leu Lys Arg He His Glu Val His Val 
260 265 270 

aag atg gac cgc age ctg gag tac cag cct gtg gaa tgc gec att gtg 
Lys Met Asp Arg Ser Leu Glu Tyr Gin Pro Val Glu Cys Ala He Val 
275 280 285 

ate aac gca gec gga gec tgg tct gcg caa ate gca gca ctg get ggt 974 
He Asn Ala Ala Gly Ala Trp Ser Ala Gin He Ala Ala Leu Ala Gly 
290 295 300 

gtt gga gag ggg ccg cct ggc acc ctg cag ggc ace aag eta cct gtg 1022 
Val Gly Glu Gly Pro Pro Gly Thr Leu Gin Gly Thr Lys Leu Pro Val 
305 310 315 

gag ccg agg aaa agg tat gtg tat gtg tgg cac tgc ccc cag gga cca 1070 
Glu Pro Arg Lys Arg Tyr Val Tyr Val Trp His Cys Pro Gin Gly Pro 
320 325 330 335 

ggc eta gaci act ccg ctt gtt gca gac acc agt gga gee tat ttt cgc 1118 
Gly Leu Glu Thr Pro Leu Val Ala Asp Thr Ser Gly Ala Tyr Phe Arg 
340 345 350 

egg gaa gga tta ggt age aac tac eta ggt ggt cgt age ccc act gag 1166 
Arg Glu Gly Leu Gly Ser Asn Tyr Leu Gly Gly Arg Ser Pro Thr Glu 
355 360 365 

cag gaa gaa ccg gac ccg gcg aac ctg gaa gtg gac cat gat ttc ttc 1214 
Gin Glu Glu Pro Asp Pro Ala Asn Leu Glu Val Asp His Asp Phe Phe 
370 375 380 

cag gac aag gtg tgg ccc cat ttg gec ctg agg gtc cca get ttt gag 1262 
Gin Asp Lys Val Trp Pro His Leu Ala Leu Arg Val Pro Ala Phe Glu 
385 390 395 



1310 



ttt gac cag aat ggc gtg gtg ggc ccc cac ccg eta gtt gtc aac atg 1358 
Phe Asp Gin Asn Gly Val Val Gly Pro His Pro Leu Val Val Asn Met 
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420 425 430 

tac ttt get act ggc ttc agt ggt cac ggg etc cag cag gec cct ggc 1406 
Tyr Phe Ala Thr Gly Phe Ser Gly His Gly Leu Gin Gin Ala Pro Gly 
435 440 445 

att ggg cga get gta gca gag atg gta ctg aag ggc agg ttc cag ace 1454 
He Gly Arg Ala Val Ala Glu Met Val Leu Lys Gly Arg Phe Gin Thr 
450 455 460 

ate gac ctg age ccc ttc etc ttt acc cgc ttt tac ttg gga gag aag 1502 
He Asp Leu Ser Pro Phe Leu Phe Thr Arg Phe Tyr Leu Gly Glu Lys 
465 470 475 

ate cag gag aac aac ate ate tga gcatgtgtgc tetgeactgg ctccactggc 1556 
He Gin Glu Asn Asn He He * 
480 485 

ttgcatcctg gctgtgttca cagccttgtt tgctgcttcc atcttcccca gtactgtgcc 1616 

aggccttctc cccctcccca gtgtcctctc ctctcaggca ggecattgea cccatatggc 1676 

tgggcaggca caggcagtga ggecgaggee aatagcgagt gatgageggg atcctaggac 1736 

tgatctgtag cccatgctga tgtcacccac cagggcaatc catctggagg cctgagcacc 1796 

ctggcccagg actggcttca tcctggcact gaccaggaaa gactgcctct gaccctctta 1856 

gcagacagag cccaggcatg ggagcactct ggggcagect ggctcaggtt tattgatttt 1916 

catctattta ccctatccat taatcaatac atgtaattaa ctcctaaaaa aaaaaaaaaa 1976 

1981 

aaaaa 

<210> 2 
<211> 486 
<212> PRT 

<213> Homo Sapiens 
<400> 2 

Met He Arg Arg Val Leu Pro His Gly Met Gly Arg Gly Leu Leu Thr 

I 5 10 15 

Arg Arg Pro Gly Thr Arg Arg Gly Gly Phe Ser Leu Asp Trp Asp Gly 

20 25 30 

Lys Val Ser Glu lie Lys Lys Lys He Lys Ser He Leu Pro Gly Arg 

35 40 45 

Ser Cys Asp Leu Leu Gin Asp Thr Ser His Leu Pro Pro Glu His Ser 

50 * 55 60 

Asp Val Val He Val Gly Gly Gly Val Leu Gly Leu Ser Val Ala Tyr 
65 70 75 80 

Trp Leu Lys Lys Leu Glu Ser Arg Arg Gly Ala He Arg Val Leu Val 

85 90 95 

Val Glu Arg Asp His Thr Tyr Ser Gin Ala Ser Thr Gly Leu Ser Val 

100 105 HO 

Glv Glv He Cys Gin Gin Phe Ser Leu Pro Glu Asn He Gin Leu Ser 

115 120 125 

Leu Phe Ser Ala Ser Phe Leu Arg Asn He Asn Glu Tyr Le.u Ala Val 

130 135 140 

Val Asp Ala Pro Pro Leu Asp Leu Arg Phe Asn Pro Ser Gly Tyr Leu 
145 150 155 160 

Leu Leu Al<v Ser Glu Lys Asp Ala Ala Ala Met Glu Ser Asn Val Lys 

165 170 175 

Val Gin Arg Gin Glu Gly Ala Lys Val Ser Leu Met Ser Pro Asp Gin 

180 185. 190 

Leu Arg Asn Lys Phe Pro Trp He Asn Thr Glu Gly Val Ala Leu Ala 

195 200 205 

Ser Tyr Gly Met Glu Asp Glu Gly Trp Phe Asp Pro Trp Cys Leu Leu 

210 215 220 

Gin Gly Leu Arg Arg Lys Val Gin Ser Leu Gly Val Leu Phe Cys Gin 
225 230 235 240 

Gly Glu Val Thr Arg Phe Val Ser Ser Ser Gin Arg Met Leu Thr Thr 

245 250 255 

Asp Asp Lys Ala Val Val Leu Lys Arg He His Glu Val His Val. Lys 

260 265 270 

Met Asp Arg Ser Leii Glu Tyr Gin Pro Val Glu Cys Ala lie Val He 
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275 280 285 

Asn Ala Ala Gly Ala Trp Ser Ala Gin lie Ala Ala Leu Ala Gly Val 

290 295 300 

GlV Glu Gly Pro Pro Gly Thr Leu Gin Gly Thr Lys Leu Pro Val Glu 
305 " 310 315 320 

Pro Arg Lys Arg Tyr Val Tyr Val Trp His Cys Pro Gin Gly Pro Gly 

325 . 330 335 

Leu Glu Thr Pro Leu Val Ala Asp Thr Ser Gly Ala Tyr Phe Arg Arg 

340 345 350 

Glu Gly Leu Gly Ser Asn Tyr Leu Gly Gly Arg Ser Pro Thr Glu Gin 

355 360 365 

Glu Glu Pro Asp Pro Ala Asn Leu Glu Val Asp His Asp Phe Phe Gin 

370 375 380 

Asp Lys Val Trp Pro His Leu Ala Leu Arg Val Pro Ala Phe Glu Thr 
38 5 390 395 400 

Leu Lys Val Gin Ser Ala Trp Ala Gly Tyr Tyr Asp Tyr Asn Thr Phe 

405 410 415 

Asp Gin Asn Gly Val Val Gly Pro His Pro Leu Val Val Asn Met Tyr 

420 425 430 

Phe Ala Thr Gly Phe Ser Gly His Gly Leu Gin Gin Ala Pro Gly He 

435 440 445 

Gly Arg Ala Val Ala Glu Met Val Leu Lys Gly Arg Phe Gin Thr He 

450 455" 460 

Asp Leu Ser Pro Phe Leu Phe Thr Arg Phe Tyr Leu Gly Glu Lys He 
465 470 475 480 

Gin Glu Asn Asn He He 
485 

<210> 3 
<211> 1750 
<212> DNA 
<213> Homo Sapiens 

<400> 3 

ggcacgagcg gggacggagc gagccggcgc cagggcccct cgggccggga agaggggaag 
gggagcgagg ctgatgcccg gcggaggggc gagcgcggcg tctggccggc ttctcaccgc 
cgcggagcaa agagggtccc gggaagcggc agggtcggcg tccaggagcg gcttcggggg 
ctccggcggc ggcagaggcg gagcaagcgg ccccgggtcc gggagcggag gccctggggg 
ccccgcgggc aggatgagct tgaccccgaa ggagctctcg agcctgctga gcatcatatc 
ggaggaggcg ggcggcggca gcaccttcga gggcctgtcc accgccttcc accactactt 360 
cagcaaggcc gaccacttcc gcctgggctc ggtgctcgtc atgctgctcc agcagcccga 420 
cctgctgcct agcgcggcgc agcgcctcac ggcgctctac ctgctctggg agatgtaccg 480 
caccgagccg ctggccgcca accccttcgc cgccagcttc gcgcacctgc tcaaccccgc 540 
gccgcccgcc cgcggcggcc aggaacccga ccgccctccg ctctcaggat ttttacctcc 600 
tataactcca ccagaaaagt tttttctttc ccagctgatg ctggcacccc cacgggaact 
cttcaaaaag acgcctcgcc agattgcact gatggacgtt ggaaacatgg gccagtctgt 
ggacattagt gggcttcagt tagccttggc cgaacgccaa tctgaattgc caacgcaaag 
caaagcgagc ttccccagta ttctcagtga cccagacccg gattcttcta attctggatt 
tgacagctca gttgcctctc agatcacaga agctttagtc agcggaccaa agccacctat 
tgaaagccat tttcgaccag agtttattcg tccaccgcct ccactccaca tttgtgagga 
tgaacttgct tggctaaacc ccacggagcc tgaccacgcg atccagtggg ataaatcgat 1020 
gtgtgttaag aatagcactg gtgtggagat caaacgaata atggccaaag ccttcaaaag 1080 
ccccttatcc tctccccaac aaacacagct acttggtgag ttggaaaaag accccaaact 1140 
tgtctaccat attggcctca ccccagccaa acttcctgac cttgtggaaa acaacccttt 1200 
agtcgctata gaaatgttgc tgaaattaat gcagtcaagc cagatcactg agtatttctc 1260 
tgtcctggtc aatatggaca tgtctttaca ttcaatggaa gttgtaaatc gactaactac 1320 
agctgttgat ctacctcctg aatttattca cctttatata tcaaattgca tctctacttg 1380 
tgaacagatt aaggataaat atatgcagaa tcggttggtg cgtcttgtgt gtgtgtttct 1440 
ccaatccttg atccgtaaca aaattattaa tgtacaggat ttgtttatag aagtgcaggc 1500 
attctgtatt gaattcagta ggatacgaga agctgctggt cttttccggt tgttgaagac 1560 
attggatact ggggaaacac cttctgagac caaaatgtca aaataatacc tcatcagaac 1620 
catcccatcc attcactgtt cagctgtact gtgatttagt ttttacaccg ttaaaaccct 1680 
gagtggattg cttggtttaa tgcatataaa cagtacttta tctacttaaa gcaaaaaaaa i^o 
aaaaaaaaaa 

<210> 4 
<211> 510 
<212> PRT 



60 
120 
180 
240 
300 



660 
720 
780 
840 
900 
960 
020 
1080 
1140 
1200 



1740 
1750 
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<213> Homo Sapiens 

<400> 4 ^ 

Met Pro Gly Gly Gly Ala Ser Ala Ala Ser Gly Arg Leu Leu Thr Ala 

I 5 .10 15 

Ala Glu Gin Arg Gly Ser Arg Glu Ala Ala Gly Ser Ala Ser Arg Ser 

20 25 30 

Gly Phe Gly Gly Ser Gly Gly Gly Arg Gly Gly Ala Ser Gly Pro Gly 

35 40 45 

Ser Gly Ser Gly Gly Pro Gly Gly Pro Ala Gly Arg Met Ser Leu Thr 

50 55 60 

Pro Lvs Glu Leu Ser Ser Leu Leu Ser He He Ser Glu Glu Ala Gly 
65 70 "75 80 

Glv Glv Ser Thr Phe Glu Gly Leu Ser Thr Ala Phe His His Tyr Phe 

85 90 95 

Ser Lys Ala Asp His Phe Arg Leu Gly Ser. Val Leu Val Met Leu Leu 

100 105 HO 

Gin Gin Pro Asp Leu Leu Pro Ser Ala Ala Gin Arg Leu Thr Ala Leu 

115 120 125 

Tyr Leu Leu Trp Glu Met Tyr Arg Thr Glu Pro Leu Ala Ala Asn Pro 

130 135 140 

Phe Ala Ala Ser Phe Ala His Leu Leu Asn Pro Ala Pro Pro Ala Arg 
145 150 155 160 

Glv Gly Gin Glu Pro Asp Arg Pro Pro Leu Ser Gly Phe Leu Pro Pro 

165 170 175 

He Thr Pro Pro Glu Lys Phe Phe Leu Ser Gin Leu Met Leu Ala Pro 

180 185 190 

Pro Arg Glu Leu Phe Lys Lys Thr Pro Arg Gin He Ala Leu Met Asp 

195 200 205 

Val Gly Asn Met Gly Gin Ser Val Asp He Ser Gly Leu Gin Leu Ala 

210 ' 215 220 

Leu Ala Glu Arg Gin Ser Glu Leu Pro Thr Gin Ser Lys Ala Ser Phe 
225 230 235 240 

Pro Ser He Leu Ser Asp Pro Asp Pro Asp Ser Ser Asn Ser Gly Phe 

245 250 255 

Asp Ser Ser Val Ala Ser Gin He Thr Glu Ala Leu Val Ser Gly Pro 

260 265 270 

Lvs Pro Pro He Glu Ser His Phe Arg Pro Glu Phe He Arg Pro Pro 

275 280 285 

pro Pro Leu His He Cys Glu Asp Glu Leu Ala Trp Leu Asn Pro Thr 

290 295 300 

Glu Pro Asp His Ala He Gin Trp. Asp Lys Ser Met Cys Val Lys Asn 
305 310 315 320 

Ser Thr Gly Val Glu He Lys Arg He Met Ala Lys Ala Phe Lys Ser 

325 330 335 

Pro Leu Ser Ser Pro Gin Gin Thr Gin Leu Leu Gly Glu Leu Glu Lys 

340 345 350 

Asp Pro Lys Leu Val Tyr His He Gly Leu Thr Pro Ala Lys Leu Pro 

355 360 365 

Asp Leu Val Glu Asn Asn Pro Leu Val Ala He Glu Met Leu Leu Lys 

370 375 380 

Leu Met Gin Ser Ser Gin He Thr Glu Tyr Phe Ser Val Leu Val Asn 
385 390 395 400 

Met Asp Met Ser Leu His Ser Met Glu Val Val Asn Arg Leu Thr Thr 

405 410 415 

Ala Val Asp Leu Pro Pro Glu Phe He His Leu Tyr He Ser Asn Cys 

420 425 430 

lie Ser Thr Cys Glu Gin He Lys Asp Lys Tyr Met Gin Asn Arg Leu 

435 440 445 

Val Arg Leu Val Cys Val Phe Leu Gin Ser Leu He Arg Asn Lys He 

450 455 460 

He Asn Val Gin Asp Leu Phe He Glu Val Gin Ala Phe Cys He Glu 
465 470 475 480 

Phe Ser Arg He Arg Glu Ala Ala Gly Leu Phe Arg Leu Leu Lys Thr 

4 85 4 90 4 95 

Leu Asp Thr Gly Glu Thr Pro Ser Glu Thr Lys Met Ser Lys 
500 505 510 
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<210> 5 

<211> 3346 

<212> DNA 

<213> Homo Sapiens 



<210> 6 

<211> 258 

<212> PRT 

<213> Homo Sapiens 



60 



<400> 5 . , 

ggcacgagcV. ggctcgcgcg tgccttttcc cctcaggttg tggggagagc ggaatcctgc 

tccgccgtcg cagcagcagc ggcagccccg gcagcctcgg gcgacagcgg cggcgcgcga 120 

gcccccgggc ggaccgtacc accgctcgcc agcacgcagg gggagccgcc cgtctcgccg 180 

cgcacgcctc ggcgaccccg cggggctgag gcgtcgccgc gcccggcagc gtgagcgcag 240 

agccggcctc gaccccgagc tcggagcccc gcgggccgcg cccgccgccg gccccaccca 300 

tccgggtcga ggaggccgag gccatggctg agacggagga gcggagcctg gacaacttct 360 

ttgccaagag ggacaagaag aagaagaagg agcggagcaa ccgggcggcg agtgccgcgg 420 

gcgcagcggg cagcgccggc ggaagcagtg gagccgcggg tgcggcgggc ggcggggcgg 480 

acgcggggac ccggccgggt gacggcggga ccgccagcgc gggggctgcg ggcccagggg 540 

ccgccaccaa ggctgtgacg aaggacgaag atgaatggaa agaattggag caaaaagagg 600 

ttgattacag cggcctcagg gttcaggcaa tgcaaataag cagtgaaaag gaagaagacg 660 

ataatgaaaa gagacaagat ccaggtgata actgggaaga aggtggaggt ggtggtggag 720 

gtatggaaaa atcttcaggt ccctggaata aaacagctcc agtacaagca cctcctgctc 780 

cagtaattgt tacagaaacc ccagaaccag cgatgactag tggtgtgtat aggcctcctg 840 

960 
1020 
1080 



gggccaggtt aaccacaaca aggaaaacac cacaaggacc accagaaatc tacagtgata 
cacagttccc atccctgcag tcaactgcca agcatgtaga aagccggaag gataaagaaa 
tggagaagag ctttgaagta gtaagacaca aaaatagagg tagggatgag gtttcaaaaa 
accaggccct taaacttcag ctagacaacc aatatgctgt gcttgaaaat cagaaaagca 

gccactcaca atacaattaa ggaatgggct ttgctaaccc ttctgaggta actagactgc 1140 

agctaaccac caccaacagc cattcatcat ctgatctctg ctggatctac agacaccgat 1200 

gcagaccact cgatttcatg accggcccta ttgcactatg gaagttaaag tgtcacgact 1260 

gctctatgca tattggattt aggggaattt tcattgttac ataaatgtgt gaactagttt 1320 

caacagtgtt ctttcatatt tactctgcaa atacaaaaaa ccaaaacctg cagccagtgg 1380 

tcatttcaaa atctttttat gttcagatac tgagccttca taagggttga ctacctcaga 1440 

tttgctgcac tcattgtgga cttcatgtgg atcacaactt ctggataaga agattacaac 1500 

tattaagtgt cgatgtgaac cttgcaacca gctctactgg attcttatca gaaatcctgc 1560 

ataaaaagtc agccatctgg gttctgatct gctgtaaaag atgaagattt aagtgacctt 1620 

aattaacctg tcctgtgccc tacccttaag gaatactctc tgtagtaggc tgttgttata 1680 

ttagacttcc tggaacacac cgctgaaaag aactgatgtg ttcagatcat ctgtgtaggg 1/40 

ctgtgatttg taatttaaac taattgtatt ctgaggtaac cacaaaataa attcaaccaa 1800 

actggggtcc accaagtggg ggaaggggaa gggagagaat aatcttgggg gttttttttt I860 

ttggtaattt ttttatttgg atagtgcttt tttgttttgt ttttgttctg cattaaggcc 1920 

ttttttgctt tgacttgaaa taagttcttt gacagagcat attgcttggt taattaagta 1980 

acctaaagta tgcattagga ttgtgaaatg tctcgtgagt atgccaatcc tgagggtgga 2040 

accaaatagc ctttgatgaa aagggcagtg gattctggag gctctacttc aggtgctgct 2100 

ataatgectc atctaatcag gactaaattg tgtaggaaac tgcagtggga agaatatgct 2160 

ttctgctcag gctaagaggg tcactgatct gtccttagaa attcagagta acatgagcaa 2220 

aacctcagct aaaacccatt taagtggcat ggattgtgca tgatctttga taagaattcc 2280 

tcatgtactt gtgcctagtt tttcaaggta ttggctgttc tatagatgca gtgattgtcc 2340 

cagctagctc tgttaccagc cttttggtgt gtctttatgt tcatttggag agtcagggcg 2400 

aaagacaggt gatgtagcac ttctgttttt aataattatt gcttaaaata cctattaata 2460 

gttttgggtc atttaaaggg acttgaggaa gctacccagg attacagaag agtgtccacc 2520 

taacaagatg gtctggcagt ttcctagttt tgtatctggt tcaatagaaa tatgtgaaag 2580. 

tggtaatgtc atcatttgat gcagagtccg ggtttctcta taataaatcc ctttgccaaa 2640 

tqcatgagtt gcagacttgc tactggcaag agtgaagcaa gtgggtgagt aaaactattt 2700 

tgacgtggga gcgttttcag ataggagttt agtcttgacg aaagtgtccg tgcaggaatt 2760 

ggactccgag gagggttaca gtatctcctg acgggacctg ccactcgcat ctgggcaatg 2820 
ttgacatttg aggtggcagg caggatgcct gccttctaat atatttgggt gagtaactga 2880 
qccagccaag ggaaggttga atgattaaat cagaaatggg attcttggta aactgaagac 2940 
ttttatttgg gaatgaaaaa ccttaaaaaa atctcttcat cgttgaactg tgcattttcc 3000 
ctgcattttt tcccaacaaa attttgttgg gggttatgtt actgaagaat gaacagatga 3060 
gtaagtggag gtgttatgta aaggcatatt gtactcgaaa tctgaagacc tgcagcagat 3120 
ttaaattaca actcttgtta taacttttta aaagattgtg aaaatatcaa aatataaatg 3180 
aatcaagttl. taatatactg tatgatgggt ggatgaggct gtccattgta ccatttgttt 3240 
gaattctcag gcatggtttg gcagtgcaag aattctgtaa cattaacaaa ttcaataaaa 3300 
agtaaatata tggaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa oJ4o 
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Thr Gin 
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Lys Asp 
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40 
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Ala Gly Thr 
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Gly 
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Lys 
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85 
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nJ.d 


Met 


Gin 


Arg 


Gin 
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Pro 
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Glv 
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Lys 
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Pro 


Pro 
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Thr 


Ser 


Gly Val 


165 








Lys 


Thr 


Pro 


Gin 


Ser 


Leu 


Gin 


Ser 








200 


Met 


Glu 


Lys 


Ser 






215 




Glu 


Val 


Ser 


Lys 




230 






Ala 


Val 


Leu 


Glu 


245 









Leu Asp Asn Phe 
10 

Ser Asn Arg Ala 
25 

Ser Ser Gly Ala 

Arg Pro Gly Asp 
60 

Ala Ala Thr Lys 
75 

Glu Gin Lys Glu 
90 

He Ser Ser Glu 
105 

Gly Asp Asn Trp 

Ser Ser Gly Pro 
140 

Pro Val He Val 
155 

Tyr Arg Pro Pro 
170 

Gly Pro Pro Glu 
185 

Thr Ala Lys His 

Phe Glu Val Val 
220 

Asn Gin Ala Leu 
235 

Asn Gin Lys Ser 
250 



Phe 


Ala Lys 


Arg 




15. 




Ala 


Ser Ala 


Ala 




30 




Ala Gly Ala Ala 


45 






Gly Gly Thr Ala 


Ala 


Val Thr 


Lys 






80 


Val 


Asp Tyr 


Ser 




95 




Lys 


Glu Glu 


Asp 




110 




Glu 


Glu Gly Gly 


125 






Trp Asn Lys 


Thr 


Thr 


Glu Thr 


Pro 






160 


Gly Ala Arg 


Leu 




175 




He 


Tyr Ser 


Asp 




190 




Val 


Glu Ser 


Arg 


205 






Arg 


His Lys 


Asn 


Lys 


Leu Gin 


Leu 




240 


Ser 


His Ser 


Gin 




255 





<210> 7 
<211> 2068 
<212> DNA 

<213> Homo Sapiens 
<400> 7 

ggcacgaggt ggcatagcat aaccacagta agaacagaac agatattcag cagaaaactt 
tttatactct aattcttttt tttttttttt ttgagacaga gttttagtct tgtttcccag 
gctggagtgc aatggcacaa tcttggctca ctgcaacctc cgcctcctgg gttcaggcaa 
ttttcctgcc tcagcctccc aagtagctgg gattacaggc acccaccacc atgcccagct 240 

300 
360 
420 
480 



60 
120 
180 



aatttttgta tttttaatag agagctaata attgtatatt taataaagac gggtttcacc 
atgttggcca ggctggtctt gaactcctga cctcaggtga tcctcctgca ttggcctccc 
aaagtgctgg aattccaggc atgagccact gcgcccag.tc tacacactaa ttcttgttag. 
cccaacagct gttctgttct atctacccct catttcacgc tcaaggagtc atacctagaa 
tagttacaca caagagggaa actggaagcc aaacactgta cagtattgtg tagaaagtca 540 

■ - - - - i; - *- - - - - 600 

660 



cctccctact ccttttattt tacatgagtg ctgatgtgtt ttggcagatg agctttcagc 

tgaggcctga tggaaattga gataacctgc aaagacataa cagtatttat gagttatatc 

ttaottcttg aaattgtgga atgcatgatt gacaatatat ttttaatttt tattttttca 720 

—~ J ~~ J " ' ..... — — — 780 

840 
900 
960 



agtaatacca gtactgttta actatagcca gaactggcta aaatttttat attttcagag 
ttgaagttgg tgaagacatt catgatttaa acaccagatc ctgaaagggg ttaaatctac 
tttgaaatga atctgcaatc agtatttcaa agcttttctg gtaattttag tgatcttatt 
tgattagact ttttcagaag tactaaataa ggaattttaa caggttttta ttaatgcaca 

gataaataga agtacagtga ggtctatagc cattttatta aaatagctta aaagtttgta 1020 

aaaaaatgaa tctttgtaat tacttaatat gttagttaag aacccgtcaa gcttatattt 1080 

gctagactta caaattattt taaatgcatt tatctttttt gacactattc agtggaatgt 1140 

gtaagctagc taattcttgt tttctgattt aaagcacttt taaatcttat cctgccccct 1200 

aaa.aacaaaa ggttttgatc acaaggggaa atttaagatt gttaaccctg tttttcagaa 1260 

gggctactgt taattgcaca taaacatgaa atgtgttttc ccctgtgtac taacacattc 1320 

taggcaaaat tcaaacttat agtggtaaag aaacaggttg ttcacttgct gaggtgcaaa 1380 

aattcttaag acttctgttt gaaattgctc aatgactagg aaaagatgta gtagtttact 1440 

aaaattgttt ttctaccata tcaaattaaa caattcatgc ctttttaggg tcaggcctac 1500 

aatgaatagg tatggtggtt tcacagaatt ttaaaataga gttaaaggga agtgatgtac 1560 
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tcataaaaac 
tcaattagca 
actgtattaa 
tacagtctgv 
tttttaaatg 
atcatcattt 
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cattagggta 
caaaagcttt 
aatcacttgg 
gtctagacat 
tattttagtg 
attagacata 
tatttttgcc 
actgggttct 
aaaaaaaaaa 



gggagatgaa 
acaaatggaa 
ttggttgtat 
tttataaaca 
ctataaaata 
ctgtttttat 
ctgaattaag 
aataaattaa 
aaaaaaaa 



8 

tcaaaaaata 
accatgcaat 
tgtagctatt 
gaaatcttgg 
ctatgtgaat 
aatgttttac 
tgttaatttg 
aaattaaact 



cccctagtaa 
tacctgcctt 
acttatacag 
accaattgat 
ctcttaaaaa 
ttctgcctta 
atggaaactc 
tgtaaaaaaa 



tgctttatat 
agttcttttg 
caacatttct 
aatatttctg 
tctgacattt 
agatttaggt 
tgcttttaaa 
aaaaaaaaaa 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2068 



<210> 8 

<211> 1027 

<212> DNA 

<213> Homo Sapiens 



<400> 8 

ggcacgaggt 

tggcgagatc 

cggcgtggcc 

tgataagtcc 

tgacgattac 

gaaatgggca 

tgtagatgaa 

agaagatctg 

tccctgctca 

gcacacactc 

gcagctgtac 

caaagctgcc 

ctcggacgtt 

gctgagctta 

ggaggacatg 

ctctgcagac 

aatctcatta 

aaaaaaa 



cccaccttgt 
cggactctaa 
gcctcctgcc 
ctgacaccag 
tttctgtgtc 
tacaacaatt 
acagacagcg 
tccagcatca 
gacctggctc 
caacaggtgc 
ctccaggctt 
tttggtgagg 
gcgctggcga 
tctagtcagg 
ccctgggatg 
tgtgaatcct 
atctaaatca 



ggaggatgga 
agccgtgtct 
tcgaagacct 
tcaccctggt 
taccttccaa 
cagatggagg 
gggcagggtt 
tcctcctatc 
aggaactacg 
ttgaccaaag 
tggagaaaga 
aggtggatgc 
gccacatcct 
atttggaggt 
tagtagtatc 
agctgccagt 
aaatagcttt 



ggtgaccggg 
gctgcgccgc 
gaggagcaag 
cctggcagag 
tactaagttt 
tacagcttgg 
gaagtggaag 
agaggaggac 
tcagagttgt 
agaggaagtg 
gggcagcctc 
agtagacacg 
tactgcactg 
gggcggaaac 
atgcagaggt 
ttgcctatta 
aaagaaaaat 



gacgccgggg 
aactacagcc 
gcctgtgaca 
gatggcacca 
gtggcattgg 
atttcccaag 
aatgtggcca 
ctccagatgc 
gccaccgtcc 
cgtcagtcca 
ttgtcaaagc 
ggtatcagca 
agggagaagc 
cagggtcact 
gtgtgggccc 
tatgccaagg 
gcaaaaaaaa 



taccagaatc 
gcgaacagca 
ttctggccat 
tagtggatga 
ctagtaatga 
agtcctttga 
ggcagctgaa 
ttgttgacgc 
agcggctgca 
agcagctcct 
aggaagagtc 
gagagacctc 
aggctccaga 
gagctacaga 
ttttgttcac 
catttgcaaa 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1027 



<210> 9 

<211> 268 

<212> PRT 

<213> Homo Sapiens 



<400> 9 

Met Glu Val Thr Gly 

1 5 
Thr Leu Lys Pro Cys 
20 

Gly Val Ala Ala Ser 
35 

lie Leu Ala lie Asp 
50 

Glu Asp Gly Thr lie 
65 

Ser Asn Thr Lys Phe 
85 

Asn Asn Ser Asp Gly 
100 

Val Asp Glu Thr Asp 

- 115 
Arg Gin Leu Lys Glu 
130 

Asp Leu Gin Met Leu 
145 

Leu Arg Gin Ser Cys 
165 

Gin Val Leu Asp Gin 
180 

Gin Leu Tyr Leu Gin 
195 

Gin Glu Glu Ser Lys 



Asp Ala Gly 

Leu Leu Arg 

Cys Leu Glu 
40 

Lys Ser Leu 

55 

Val Asp Asp 
70 

Val Ala Leu 

Gly Thr Ala 

Ser Gly Ala 
120 

Asp Leu Ser 

135 
Val Asp Ala 
150 

Ala Thr Val 

Arg Glu Glu 

Ala Leu Glu 
200 

Ala Ala Phe 



Val Pro 
10 

Arg Asn 
25 

Asp Leu 

Thr Pro 

Asp Asp 

Ala Ser 

90 
Trp lie 
105 

Gly Leu 

Ser He 

Pro Cys 

Gin Arg 
170 
Val Arg 
185 

Lys Glu 



Glu Ser Gly Glu 
Tyr Ser 
Arg Ser 



Val Thr 
60 

Tyr Phe 
75 

Asn Glu 

Ser Gin 

Lys Trp 

He Leu 
140 
Ser Asp 
155 

Leu Gin 



Arg Glu 

30 
Lys Ala 
45 

Leu Val 



Gin Ser 
Gly Ser 
Gly Glu Glu Val 



Leu Cys 

Lys Trp 

Glu Ser 
110 
Lys Asn 
125 

Leu Ser 

Leu Ala 

His Thr 

Lys Gin 
190 
Leu Leu 
205 

Asp Ala 



He Arg 
15 

Gin His 

Cys Asp 

Leu Ala 

Leu Pro 

80 
Ala Tyr 
95 

Phe Asp 

Val Ala 

Glu Glu 

Gin Glu 
160 
Leu Gin 
175 

Leu Leu 
Ser Lys 
Val Asp 
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2io 215 220 

Thr Gly He Ser Arg Glu Thr Ser Ser Asp Val Ala Leu Ala Ser His 
225 230 235 240 

He Leu Thr Ala Leu Arg Glu Lys Gin Ala Pro Glu Leu Ser Leu Ser 

245 250 255 

Ser Gin Asp Leu Glu Val Gly Gly Asn Gin Gly His 
260 265 



<210> 10 
<211> 2214 
<212> DNA 

<213> Homo Sapiens 



<400> 10 

ggcacgaggc 

ttactgtaga 

gggaagaaga 

agaagaccgt 

tgacaagagt 

ttgctcaaaa 

gagattttac 

acagcatcga 

ggcctgaact 

atacacatat 

atacatgtgt 

atacacatat 

aaaactgtag 

ttttattgcc 

tcaaataaaa 

tttttttgac 

cactgcaacc 

gggattacag 

tttcactatg 

ctcccaaagt 

ctctttttaa 

taattttact 

cagtaatatg 

gatctgtggt 

ctgttggtag 

ttaaaaccat 

ttagaagaaa 

tttataataa 

tggtgcattg 

tttggaagca 

gtcatcatct 

taaattatgt 

tatagtcctg 

tgatgtattt 

ggtgttttaa 

tcgccatgca 

gtgtagtaaa 



taaatgtaga 

aaattcaccc 

ggagaagaaa 

accacaaaag 

atgtggcctt 

attctcctgt 

agatgacata 

agatcttgga 

gagagttgat 

atatgtatat 

gtatgtatgc 

atgtatacat 

actgtcctcg 

aaagtcaaat 

aattttggtc 

agagtctcgc 

tccgcctccc 

gcgcccgcca 

ttggccaggc 

gctggtatta 

ctttattttt 

gatctaaagc 

tctccttgct 

tgaatttgct 

gagttgtttg 

gtatgtgcgc 

taatcagcct 

aaaattgcaa 

tgtggtagaa 

tgtttggtaa 

gctctgagtg 

gagtaaaatt 

jaaatagcaa 

gatgaattta 

gttgctgtgg 

aattacaatc 

gatgtaaaat 



caatggttag 

aaacaagaag 

aaacagaaga 

gttactatag 

gcaacttttg 

ggtgcctcag 

attgatgtca 

gaagtaaaga 

atggccaaag 

atacacatat 

atgtatatac 

atatacacat 

tccttggcat 

aaacgggaga 

atttggtact 

actgttgcct 

gggttcaagt 

ccacgcccag 

tggtctcgaa 

cagatgtgag 

tgaaaaaccc 

tgagtgattt 

tctttgatgt 

gtgttgttat 

agctattctg 

tgtttgaacc 

tgcataaaac 

atgttataaa 

tattatgcat 

tgtcaagtgg 

gaaggccgtt 

gtgctaacca 

ttgaaacatg 

aactttaagt 

acacttttaa 

ttgaatgagt 

taaaaaaaaa 



agaagaattt 

ctggaattag 

gaggtggaag 

ccaaaattcc 

aaattgatct 

taacagggga 

ttcaggaaaa 

agtgaatttg 

ggagagaggc 

atgtatgtat 

atatatacat 

atatgtatac 

tttcactgtt 

ctgtcatgct 

gactttctct 

gggctggaat 

gattctcctg 

ctaatttttg 

ctcctgacct 

ccaccgcacc 

ggtagacttt 

tttaaaagaa 

gatagttttg 

gaagtccacc 

gagattattt 

agtaagccac 

ttatggatga 

tgaacaattg 

atgtttaaag 

agtaccccag 

cagagaggct 

gttaaaagta 

tcttctcaca 

caggtgctgc 

gaaacttaga 

gtttttttaa 

aaaaaaaaaa 



tccaaatgaa 
tgagggtcaa 
gggtcaaata 
cagagcaaag 
taaagaagca 
ggatgaaatt 
atggccagag 
aaaatttgtc 
cttttaaaat 
acacatatac 
acacatatat 
atatatatat 
ctgtacaagg 
catgcatgaa 
ctctctctct 
gcagtggtgc 
cctcagcctc 
tatttttagt 
cgtgattggc 
cagcctgagt 
gtggggagca 
tttgaatttg 
agatgggtga 
ctgtgggcac 
ggtaaagtat 
ttctttgaca 
aagtattcat 
ggaaatggtt 
. aatcatattt 
atacatttta 
agaggttctt 
ctgtacaccc 
agagaaaatg 
aaattggaaa 
acccatggaa 
aaataaagta 
aaaaaaaaaa 



tttgcaaaac 

ggaacagcag 

aaacaaaaaa 

aagaaatatg 

caaagatttt 

atcattcagg 

gtagatgatg 

tgtatttaat 

atatatatat 

acatgtatat 

gtatacatat 

attctacagt 

ctgcttgttt 

tagaatttag 

ctctcttttt 

gatctcggct 

ccaagtaggt 

agagatgggg 

ccacctcagc 

ttctctttct 

tttttgttga 

gcttcctcac 

gaatctaata 

aataacataa 

actaaaagcc 

ttagaagaca 

cacaatatta 

aaagaagtga 

tctaagatta 

gacatttatc 

attctggcta 

atgctcaata 

acagttttaa 

gaagacttgt 

cccttgttta 

ttagaaaaat 

aaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2214 



<210> 11 
<211> 150 
<212> PRT 

<213> Homo Sapiens 



<400> 11 

His Glu Ala Lys Cys Arg Gin Trp Leu Glu Lys Asn Phe Pro Asn Glu 

1 5 10 15 

Phe Ala Lys Leu Thr Val Glu Asn Ser Pro Lys Gin Glu Ala Gly He 

20 25 30 

Ser Glu Gly Gin Gly Thr Ala Gly Glu Glu Glu Glu Lys Lys Lys Gin 

35 40 45 

Lys Arg Gly Gly Arg Gly Gin He Lys Gin Lys Lys Lys Thr Val Pro 

50 5 5 60 

Gin Lys Val Thr He Ala Lys lie Pro Arg Ala Lys Lys Lys Tyr Val 
65 70 75 80 
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Thr Arg Val Cys Gly Leu Ala Thr Phe Glu lie Asp Leu Lys Glu Ala 

85 90 95 

Gin Arg Phe Phe Ala Gin Lys Phe Ser Cys Gly Ala Ser Val Thr Gly 

100 105 110 

Glu Asp Glu He He lie Gin Gly Asp Phe Thr Asp Asp lie He Asp 

115 120 125 

Val lie Gin Glu Lys Trp Pro Glu Val Asp Asp Asp Ser He Glu Asp 

130 135 l^O 

Leu Gly Glu Val Lys Lys . 
145 150 

. <210> 12 

<211> 433 

<212> PRT 

. <213> Caenorhabditis Elegans 

Pro°TYr 2 Arg Ala Glu lie Val He He Gly Gly Gly Leu Ser Gly Ser 
1 5 10 15 

Thr Ala Phe Trp Leu Lys Glu Arg Phe Arg Asp Glu Asp Phe Lys 
20 25 30 

Val val Val Val Glu Asn Asn Asp Val Phe Thr Lys Ser Ser Thr Met 

35 40 45 

Leu Ser Thr Gly Gly He Thr Gin Gin Phe Ser He Pro Glu Phe Val 

50 55 60 

Asp Met Ser Leu Phe Thr Thr Glu Phe Leu Arg His Ala Gly Glu His 
65 70 75 80 

Leu Arg He Leu Asp Ser Glu Gin Pro Asp He Asn Phe Phe Pro Thr 

85 90 95 

Gly Tyr Leu Arg Leu Ala Lys Thr Asp Glu Glu Val Glu Met Met Arg 

100 105 HO 

Ser Ala Trp Lys Val Gin He Glu Arg Gly Ala Lys Val Gin Leu Leu 

115 120 I 25 

Ser Lys Asp Glu Leu Thr Lys Arg Tyr Pro Tyr Met Asn Val Asp Asp 

130 135 I 40 

Val Leu Leu Ala Ser Leu Gly Val Glu Asn Glu Gly Thr He Asp Thr 
145 150 155 1^0 

Trp Gin Leu Leu Ser Ala He Arg Glu Lys Asn He Thr Leu Gly Val 

F 165 170 175 

Gin Tvr Va 1 Lys Gly Glu Val Glu Gly Phe Gin Phe Glu Arg His Arg 

180 185 190 

Ala Ser Ser Glu Val His Ala Phe Gly Asp Asp Ala Thr Ala Asp Glu 

19 5 200 205 

Asn Lys Leu Arg Ala Gin Arg He Ser Gly Val Leu Val Arg Pro Gin 

210 " 215 220 

Met Asn Asp Ala Ser Ala Arg Pro lie Arg Ala His Leu lie Val Asn 
225 230 235 240 

Ala Ala Gly Pro Trp Ala Gly Gin Val Ala Lys Met Ala Gly He Gly 

245 250 255 

Lvs Glv Thr Gly Leu Leu Ala Val Pro Val Pro He Gin Pro Arg Lys 

260 265 270 

Ara Asp Val Phe Val He Phe Ala Pro Asp Val Pro Ser Asp Leu Pro 

275 280 285 

Phe He He Asp Pro Ser Thr Gly Val Phe Cys Arg Gin Thr Asp Ser 

290 295 300 

Gly Gin Thr Phe Leu Val Gly Arg Thr Pro Ser Lys Glu Glu Asp Ala 
305 310 315 • 320 

Lys Arg Asp His Ser Asn Leu Asp Val Asp Tyr Asp Asp Phe Tyr Gin 

325 330 335 

Lys He Trp Pro Val Leu Val Asp Arg Val Pro Gly Phe Gin Thr Ala 

340 345 350 

Lys Val Lys Ser Ala Trp Ser Gly Tyr Gin Asp He Asn Thr Phe Asp 

355 360 365 

Asp Ala Pro Val He Gly Glu His Pro Leu Tyr Thr Asn Leu His Met 

370 375 380 

Met Cys Gly Phe Gly Glu Arg Gly Val Met His Ser Met Ala Ala Ala 
385 390 395 400 

Arg Ala Tyr Ala Glu Arg He Phe Asp Gly Ala Tyr He Asn Val Asn 
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405 410 415 

Leu Arg Lys Phe Asp Met Arg Arg lie Val Lys Met Asp Pro lie Thr 
420 425 430 

Glu 



<210> 13 
<211> 223 
<212> PRT 

<213> Homo Sapiens 

Met°Lys 3 Val Ala Val lie Gly Gly Gly Val Ala Gly Leu Ser Ala Ala 

1 5 10 15 

Tyr Phe Leu Ala Lys Ala Gly Ala Asp Val Lys Val Phe Glu Gin Lys 

20 25 30 

Tyr Leu Leu Tyr Gly Ala Ser Gly Arg Asn Ser Gly Gly Leu Thr Ala 

35 40 45 

Gin Phe Thr Asn Glu Ala Met lie Lys Leu Ala Lys Arg Thr Leu Glu 

50 55 60 

Leu Tyr Asp Glu Leu Gin Ser Glu Val Gly Phe Asn Phe Leu Leu Arg 
65 70 75 80 

Arg Asp Gly Tyr Val Lys lie Ala Gly Lys Gly Glu Glu Ala Lys Leu 

85 90 95 

Ara Glu Glu Val Glu Phe Gin Arg Lys Ala Gly Val Lys Val Lys Met 

100 105 HO 

Val Glu Pro Glu Phe Val Lys Glu Leu Phe Pro Asp He Asn Thr Ser 

115 120 125 

Ala Phe Thr Ala Ala Ser Tyr Phe Ala Asp Gly Gly Val Val Phe Pro 

130 135 I 40 

Trp Pro Val Val Trp Gly Leu Ala Lys Gly Cys Arg Glu Leu Gly Val 
14 5 150 155 160 

Glu He Tyr Asp Tyr Thr Pro Ala Ser Val Glu Val Lys Gly Asn Asp 

165 170 175 

Leu Thr Val Lys Ala Ser Gly Glu Ser Tyr Lys Val Asp Tyr He He 

180 185 190 

Asn Ala Ala Gly Ala Trp Ser Asn Glu He Ser Gin Gin Ala Gly Val 

195 200 . 205 

Glu Leu Asn Asn Lys Val Phe Arg Glu Glu He Cys Val Thr Glu 
210 215 220 

<210> 14 
<211> 175 
<212> PRT 

<213> Drosophila Melanogaster 
<400> 14 

Leu Asp Asp Phe Phe Ala Lys Lys Asp Lys Lys Lys Ser Lys Asn Lys 

1 5 1° 15 

Thr Lys Phe Val Thr Ala Asp Glu Met Val Lys Asn Leu Glu Asp Gly 

20 25 30 

Thr Lys Arg Glu Val Val Lys Pro Lys Lys Pro Glu Ala Ala Ala Gly 

35 40 45 

Glv Val Ala Val Val Gly Glu Asn Glu Asn Ser Gly Thr Lys Val Pro " 

50 55 60 

Glu Ser Ala Pro Pro Val Glu Glu Glu Trp Lys Glu Phe Glu Glu Glu 
65 70 75 80 

Gin Arg Lys Asp Tyr Ser Gly Leu Lys He Gly Gin Leu Ser Thr He 

85 90 95 

Thr Ala Gin Glu Ser Ala Glu Ser Arg Ala Thr Arg Val Pro Thr Ala 

100 105 HO 

Gin Asp Gly Gly Asn Tyr Asp Glu Asp Asp Glu Asp Ser Asn Gly Tyr 

115 120 125 

Asp Asn Ala Asp Val Asn Lys Glu Arg Val Gly His Gly Pro Trp Lys 

130 135 - 140 

Lys Val Val Pro Ala Glu Glu Val Met Gin He Pro Val Pro Val Glu 
1^5 150 155 160 

Val Glu Lys Pro Ser Ser Lys Thr Tyr Val Ser Pro Ala Leu Arg 
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